Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

Samsung Galaxy Z Fold 7 Design Spotted in Leaked Hands-On Images Ahead of July 9 Launch

4 July 2025

Apple MacBook Pro With M5 Chip to Launch This Year; 15 Mac Computers in Development: Report

4 July 2025

The Best Video Games of 2025 So Far (Feat. John Carson)

4 July 2025
Facebook X (Twitter) Instagram
Just In
  • Samsung Galaxy Z Fold 7 Design Spotted in Leaked Hands-On Images Ahead of July 9 Launch
  • Apple MacBook Pro With M5 Chip to Launch This Year; 15 Mac Computers in Development: Report
  • The Best Video Games of 2025 So Far (Feat. John Carson)
  • The Best Video Games of 2025 (So Far) | The Game Informer Show
  • Tecno Pova 7 5G Series Launching Today: Expected Features and Specifications
  • The Person in Charge of Testing Tech for US Spies Has Resigned
  • How Gearbox Designed The Open World Of Borderlands 4
  • Trump’s Defiance of TikTok Ban Prompted Immunity Promises to 10 Tech Companies
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » These Clues Hint at the True Nature of OpenAI’s Shadowy Q* Project
News

These Clues Hint at the True Nature of OpenAI’s Shadowy Q* Project

News RoomBy News Room1 December 20233 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

There are other clues to what Q* could be. The name may be an allusion to Q-learning, a form of reinforcement learning that involves an algorithm learning to solve a problem through positive or negative feedback, which has been used to create game-playing bots and to tune ChatGPT to be more helpful. Some have suggested that the name may also be related to the A* search algorithm, widely used to have a program find the optimal path to a goal.

The Information throws another clue into the mix: “Sutskever’s breakthrough allowed OpenAI to overcome limitations on obtaining enough high-quality data to train new models,” its story says. “The research involved using computer-generated [data], rather than real-world data like text or images pulled from the internet, to train new models.” That appears to be a reference to the idea of training algorithms with so-called synthetic training data, which has emerged as a way to train more powerful AI models.

Subbarao Kambhampati, a professor at Arizona State University who is researching the reasoning limitations of LLMs, thinks that Q* may involve using huge amounts of synthetic data, combined with reinforcement learning, to train LLMs to specific tasks such as simple arithmetic. Kambhampati notes that there is no guarantee that the approach will generalize into something that can figure out how to solve any possible math problem.

For more speculation on what Q* might be, read this post by a machine-learning scientist who pulls together the context and clues in impressive and logical detail. The TLDR version is that Q* could be an effort to use reinforcement learning and a few other techniques to improve a large language model’s ability to solve tasks by reasoning through steps along the way. Although that might make ChatGPT better at math conundrums, it’s unclear whether it would automatically suggest AI systems could evade human control.

That OpenAI would try to use reinforcement learning to improve LLMs seems plausible because many of the company’s early projects, like video-game-playing bots, were centered on the technique. Reinforcement learning was also central to the creation of ChatGPT, because it can be used to make LLMs produce more coherent answers by asking humans to provide feedback as they converse with a chatbot. When WIRED spoke with Demis Hassabis, the CEO of Google DeepMind, earlier this year, he hinted that the company was trying to combine ideas from reinforcement learning with advances seen in large language models.

Rounding up the available clues about Q*, it hardly sounds like a reason to panic. But then, it all depends on your personal P(doom) value—the probability you ascribe to the possibility that AI destroys humankind. Long before ChatGPT, OpenAI’s scientists and leaders were initially so freaked out by the development of GPT-2, a 2019 text generator that now seems laughably puny, that they said it could not be released publicly. Now the company offers free access to much more powerful systems.

OpenAI refused to comment on Q*. Perhaps we will get more details when the company decides it’s time to share more results from its efforts to make ChatGPT not just good at talking but good at reasoning too.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleSamsung Galaxy S24 Series RAM Variants Tipped; Phones Reportedly Spotted on FCC Site
Next Article How to Download the iOS 17.0.3 Update to Resolve the iPhone 15 Pro Overheating Issue

Related Articles

News

The Person in Charge of Testing Tech for US Spies Has Resigned

4 July 2025
News

Trump’s Defiance of TikTok Ban Prompted Immunity Promises to 10 Tech Companies

4 July 2025
News

The 61 Best Early Amazon Prime Day Deals

3 July 2025
News

A Game Called Date Everything Literally Lets You Date Everything—Except People

3 July 2025
News

Trump’s ‘Big Beautiful Bill’ Would Leave Millions Without Health Insurance

3 July 2025
News

Trump Officials Want to Prosecute Over the ICEBlock App. Lawyers Say That’s Unconstitutional

3 July 2025
Demo
Top Articles

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024100 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

Oppo Reno 14, Reno 14 Pro India Launch Timeline and Colourways Leaked

27 May 202581 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
News

The Person in Charge of Testing Tech for US Spies Has Resigned

News Room4 July 2025
Gaming

How Gearbox Designed The Open World Of Borderlands 4

News Room4 July 2025
News

Trump’s Defiance of TikTok Ban Prompted Immunity Promises to 10 Tech Companies

News Room4 July 2025
Most Popular

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025124 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024100 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views
Our Picks

The Best Video Games of 2025 (So Far) | The Game Informer Show

4 July 2025

Tecno Pova 7 5G Series Launching Today: Expected Features and Specifications

4 July 2025

The Person in Charge of Testing Tech for US Spies Has Resigned

4 July 2025

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2025 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.