Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
The Instant Smear Campaign Against Border Patrol Shooting Victim Alex Pretti

The Instant Smear Campaign Against Border Patrol Shooting Victim Alex Pretti

25 January 2026
New study shows AI isn’t ready for office work

New study shows AI isn’t ready for office work

25 January 2026
ICE Asks Companies About ‘Ad Tech and Big Data’ Tools It Could Use in Investigations

ICE Asks Companies About ‘Ad Tech and Big Data’ Tools It Could Use in Investigations

24 January 2026
Facebook X (Twitter) Instagram
Just In
  • The Instant Smear Campaign Against Border Patrol Shooting Victim Alex Pretti
  • New study shows AI isn’t ready for office work
  • ICE Asks Companies About ‘Ad Tech and Big Data’ Tools It Could Use in Investigations
  • This is the tech that makes Volvo’s latest EV a major step forward
  • This Autonomous Aquatic Robot Is Smaller Than a Grain of Salt
  • Gear News of the Week: Apple’s AI Wearable and a Phone That Can Boot Android, Linux, and Windows
  • Best Portable Blenders of 2026: Ninja, Nutribullet, Beast
  • The Best Cheap Gaming Laptops Actually Worth Buying
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » How Game Theory Can Make AI More Reliable
News

How Game Theory Can Make AI More Reliable

News RoomBy News Room9 June 20244 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
How Game Theory Can Make AI More Reliable
Share
Facebook Twitter LinkedIn Pinterest Email

Posing a far greater challenge for AI researchers was the game of Diplomacy—a favorite of politicians like John F. Kennedy and Henry Kissinger. Instead of just two opponents, the game features seven players whose motives can be hard to read. To win, a player must negotiate, forging cooperative arrangements that anyone could breach at any time. Diplomacy is so complex that a group from Meta was pleased when, in 2022, its AI program Cicero developed “human-level play” over the course of 40 games. While it did not vanquish the world champion, Cicero did well enough to place in the top 10 percent against human participants.

During the project, Jacob—a member of the Meta team—was struck by the fact that Cicero relied on a language model to generate its dialog with other players. He sensed untapped potential. The team’s goal, he said, “was to build the best language model we could for the purposes of playing this game.” But what if instead they focused on building the best game they could to improve the performance of large language models?

Consensual Interactions

In 2023, Jacob began to pursue that question at MIT, working with Yikang Shen, Gabriele Farina, and his adviser, Jacob Andreas, on what would become the consensus game. The core idea came from imagining a conversation between two people as a cooperative game, where success occurs when a listener understands what a speaker is trying to convey. In particular, the consensus game is designed to align the language model’s two systems—the generator, which handles generative questions, and the discriminator, which handles discriminative ones.

After a few months of stops and starts, the team built this principle up into a full game. First, the generator receives a question. It can come from a human or from a preexisting list. For example, “Where was Barack Obama born?” The generator then gets some candidate responses, let’s say Honolulu, Chicago, and Nairobi. Again, these options can come from a human, a list, or a search carried out by the language model itself.

But before answering, the generator is also told whether it should answer the question correctly or incorrectly, depending on the results of a fair coin toss.

If it’s heads, then the machine attempts to answer correctly. The generator sends the original question, along with its chosen response, to the discriminator. If the discriminator determines that the generator intentionally sent the correct response, they each get one point, as a kind of incentive.

If the coin lands on tails, the generator sends what it thinks is the wrong answer. If the discriminator decides it was deliberately given the wrong response, they both get a point again. The idea here is to incentivize agreement. “It’s like teaching a dog a trick,” Jacob explained. “You give them a treat when they do the right thing.”

The generator and discriminator also each start with some initial “beliefs.” These take the form of a probability distribution related to the different choices. For example, the generator may believe, based on the information it has gleaned from the internet, that there’s an 80 percent chance Obama was born in Honolulu, a 10 percent chance he was born in Chicago, a 5 percent chance of Nairobi, and a 5 percent chance of other places. The discriminator may start off with a different distribution. While the two “players” are still rewarded for reaching agreement, they also get docked points for deviating too far from their original convictions. That arrangement encourages the players to incorporate their knowledge of the world—again drawn from the internet—into their responses, which should make the model more accurate. Without something like this, they might agree on a totally wrong answer like Delhi, but still rack up points.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleI’m sorry in advance to all my future Marvel Rivals teammates
Next Article I saw an absurd game about rabbits at Summer Game Fest, and I’m obsessed with it

Related Articles

The Instant Smear Campaign Against Border Patrol Shooting Victim Alex Pretti
News

The Instant Smear Campaign Against Border Patrol Shooting Victim Alex Pretti

25 January 2026
New study shows AI isn’t ready for office work
News

New study shows AI isn’t ready for office work

25 January 2026
ICE Asks Companies About ‘Ad Tech and Big Data’ Tools It Could Use in Investigations
News

ICE Asks Companies About ‘Ad Tech and Big Data’ Tools It Could Use in Investigations

24 January 2026
This is the tech that makes Volvo’s latest EV a major step forward
News

This is the tech that makes Volvo’s latest EV a major step forward

24 January 2026
This Autonomous Aquatic Robot Is Smaller Than a Grain of Salt
News

This Autonomous Aquatic Robot Is Smaller Than a Grain of Salt

24 January 2026
Gear News of the Week: Apple’s AI Wearable and a Phone That Can Boot Android, Linux, and Windows
News

Gear News of the Week: Apple’s AI Wearable and a Phone That Can Boot Android, Linux, and Windows

24 January 2026
Demo
Top Articles
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024107 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024101 Views
Costco partners with Electric Era to bring back EV charging in the U.S.

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202497 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
Gear News of the Week: Apple’s AI Wearable and a Phone That Can Boot Android, Linux, and Windows News

Gear News of the Week: Apple’s AI Wearable and a Phone That Can Boot Android, Linux, and Windows

News Room24 January 2026
Best Portable Blenders of 2026: Ninja, Nutribullet, Beast News

Best Portable Blenders of 2026: Ninja, Nutribullet, Beast

News Room24 January 2026
The Best Cheap Gaming Laptops Actually Worth Buying News

The Best Cheap Gaming Laptops Actually Worth Buying

News Room24 January 2026
Most Popular
The Spectacular Burnout of a Solar Panel Salesman

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025136 Views
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024107 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024101 Views
Our Picks
This is the tech that makes Volvo’s latest EV a major step forward

This is the tech that makes Volvo’s latest EV a major step forward

24 January 2026
This Autonomous Aquatic Robot Is Smaller Than a Grain of Salt

This Autonomous Aquatic Robot Is Smaller Than a Grain of Salt

24 January 2026
Gear News of the Week: Apple’s AI Wearable and a Phone That Can Boot Android, Linux, and Windows

Gear News of the Week: Apple’s AI Wearable and a Phone That Can Boot Android, Linux, and Windows

24 January 2026

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2026 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.