Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
The cheese-grater Mac Pro is no more, but Apple will still sell you an old one

The cheese-grater Mac Pro is no more, but Apple will still sell you an old one

28 March 2026
Research finds generative AI making frauds a cakewalk for bad actors

Research finds generative AI making frauds a cakewalk for bad actors

28 March 2026
M5 MacBook Pro tests show Apple is pretty close to fixing its worst weakness

M5 MacBook Pro tests show Apple is pretty close to fixing its worst weakness

28 March 2026
Facebook X (Twitter) Instagram
Just In
  • The cheese-grater Mac Pro is no more, but Apple will still sell you an old one
  • Research finds generative AI making frauds a cakewalk for bad actors
  • M5 MacBook Pro tests show Apple is pretty close to fixing its worst weakness
  • Sony is halting sales of memory cards and you have AI to blame for it
  • I see Apple skipping the AI hellfire, but shaping Siri as the most flexible assistant
  • March Madness, Revisited: The AI Model Did Well. But Mad Things Still Happen
  • Apple announces new sci-fi film Liminal and I can’t wait for it
  • What Is the Best Garmin Watch Right Now? (2026)
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » OpenAI Touts New AI Safety Research. Critics Say It’s a Good Step, but Not Enough
News

OpenAI Touts New AI Safety Research. Critics Say It’s a Good Step, but Not Enough

News RoomBy News Room17 July 20244 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
OpenAI Touts New AI Safety Research. Critics Say It’s a Good Step, but Not Enough
Share
Facebook Twitter LinkedIn Pinterest Email

OpenAI has faced opprobrium in recent months from those who suggest it may be rushing too quickly and recklessly to develop more powerful artificial intelligence. The company appears intent on showing it takes AI safety seriously. Today it showcased research that it says could help researchers scrutinize AI models even as they become more capable and useful.

The new technique is one of several ideas related to AI safety that the company has touted in recent weeks. It involves having two AI models engage in a conversation that forces the more powerful one to be more transparent, or “legible,” with its reasoning so that humans can understand what it’s up to.

“This is core to the mission of building an [artificial general intelligence] that is both safe and beneficial,” Yining Chen, a researcher at OpenAI involved with the work, tells WIRED.

So far, the work has been tested on an AI model designed to solve simple math problems. The OpenAI researchers asked the AI model to explain its reasoning as it answered questions or solved problems. A second model is trained to detect whether the answers are correct or not, and the researchers found that having the two models engage in a back and forth encouraged the math-solving one to be more forthright and transparent with its reasoning.

OpenAI is publicly releasing a paper detailing the approach. “It’s part of the long-term safety research plan,” says Jan Hendrik Kirchner, another OpenAI researcher involved with the work. “We hope that other researchers can follow up, and maybe try other algorithms as well.”

Transparency and explainability are key concerns for AI researchers working to build more powerful systems. Large language models will sometimes offer up reasonable explanations for how they came to a conclusion, but a key concern is that future models may become more opaque or even deceptive in the explanations they provide—perhaps pursuing an undesirable goal while lying about it.

The research revealed today is part of a broader effort to understand how large language models that are at the core of programs like ChatGPT operate. It is one of a number of techniques that could help make more powerful AI models more transparent and therefore safer. OpenAI and other companies are exploring more mechanistic ways of peering inside the workings of large language models, too.

OpenAI has revealed more of its work on AI safety in recent weeks following criticism of its approach. In May, WIRED learned that a team of researchers dedicated to studying long-term AI risk had been disbanded. This came shortly after the departure of cofounder and key technical leader Ilya Sutskever, who was one of the board members who briefly ousted CEO Sam Altman last November.

OpenAI was founded on the promise that it would make AI both more transparent to scrutiny and safer. After the runaway success of ChatGPT and more intense competition from well-backed rivals, some people have accused the company of prioritizing splashy advances and market share over safety.

Daniel Kokotajlo, a researcher who left OpenAI and signed an open letter criticizing the company’s approach to AI safety, says the new work is important, but incremental, and that it does not change the fact that companies building the technology need more oversight. “​The situation we are in remains unchanged,” he says. “Opaque, unaccountable, unregulated corporations racing each other to build artificial superintelligence, with basically no plan for how to control it.”

Another source with knowledge of OpenAI’s inner workings, who asked not to be named because they were not authorized to speak publicly, says that outside oversight of AI companies is also needed. “The question is whether they’re serious about the kinds of processes and governance mechanisms you need to prioritize societal benefit over profit,” the source says. “Not whether they let any of their researchers do some safety stuff.”

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleNintendo Is Raising The Famicom Detective Club Series From The Dead With Emio – The Smiling Man
Next Article Looking for a cheap student laptop deal during Prime Day? How about $139?

Related Articles

The cheese-grater Mac Pro is no more, but Apple will still sell you an old one
News

The cheese-grater Mac Pro is no more, but Apple will still sell you an old one

28 March 2026
Research finds generative AI making frauds a cakewalk for bad actors
News

Research finds generative AI making frauds a cakewalk for bad actors

28 March 2026
M5 MacBook Pro tests show Apple is pretty close to fixing its worst weakness
News

M5 MacBook Pro tests show Apple is pretty close to fixing its worst weakness

28 March 2026
Sony is halting sales of memory cards and you have AI to blame for it
News

Sony is halting sales of memory cards and you have AI to blame for it

28 March 2026
I see Apple skipping the AI hellfire, but shaping Siri as the most flexible assistant
News

I see Apple skipping the AI hellfire, but shaping Siri as the most flexible assistant

28 March 2026
March Madness, Revisited: The AI Model Did Well. But Mad Things Still Happen
News

March Madness, Revisited: The AI Model Did Well. But Mad Things Still Happen

28 March 2026
Demo
Top Articles
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024132 Views
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024111 Views
Costco partners with Electric Era to bring back EV charging in the U.S.

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 2024100 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
March Madness, Revisited: The AI Model Did Well. But Mad Things Still Happen News

March Madness, Revisited: The AI Model Did Well. But Mad Things Still Happen

News Room28 March 2026
Apple announces new sci-fi film Liminal and I can’t wait for it News

Apple announces new sci-fi film Liminal and I can’t wait for it

News Room28 March 2026
What Is the Best Garmin Watch Right Now? (2026) News

What Is the Best Garmin Watch Right Now? (2026)

News Room28 March 2026
Most Popular
The Spectacular Burnout of a Solar Panel Salesman

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025137 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024132 Views
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024111 Views
Our Picks
Sony is halting sales of memory cards and you have AI to blame for it

Sony is halting sales of memory cards and you have AI to blame for it

28 March 2026
I see Apple skipping the AI hellfire, but shaping Siri as the most flexible assistant

I see Apple skipping the AI hellfire, but shaping Siri as the most flexible assistant

28 March 2026
March Madness, Revisited: The AI Model Did Well. But Mad Things Still Happen

March Madness, Revisited: The AI Model Did Well. But Mad Things Still Happen

28 March 2026

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2026 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.