Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

Owlcat Games Reveals The Expanse: Osiris Reborn

8 June 2025

Bill Atkinson, Macintosh Pioneer and Inventor of Hypercard, Dies at 74

8 June 2025

Fading Echo Is A Magical Puzzle Platformer With An All Star Cast, And It’s Launching Soon

8 June 2025
Facebook X (Twitter) Instagram
Just In
  • Owlcat Games Reveals The Expanse: Osiris Reborn
  • Bill Atkinson, Macintosh Pioneer and Inventor of Hypercard, Dies at 74
  • Fading Echo Is A Magical Puzzle Platformer With An All Star Cast, And It’s Launching Soon
  • Check Out System Shock 2: 25th Anniversary Remaster’s Multiplayer In New Gameplay Trailer
  • Get A New Look At Sleep Awake, A Psychedelic Horror Game From Blumhouse And Nine Inch Nails’ Robin Finck
  • Killer Inn Is Square Enix’s Take On Among Us
  • Arc Raiders Gets October Launch Date
  • Game of Thrones: War For Westeros Announced
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » OpenAI Offers a Peek Inside the Guts of ChatGPT
News

OpenAI Offers a Peek Inside the Guts of ChatGPT

News RoomBy News Room6 June 20243 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

ChatGPT developer OpenAI’s approach to building artificial intelligence came under fire this week from former employees who accuse the company of taking unnecessary risks with technology that could become harmful.

Today OpenAI released a new research paper apparently aimed at showing it is serious about tackling AI risk by making its models more explainable. In the paper, researchers from the company lay out a way to peer inside the AI model that powers ChatGPT. They devised a way to identify how it stores certain concepts—including those that might perhaps cause an AI system to misbehave.

Although the research makes OpenAI’s work on keeping AI in check more visible, it also highlights recent turmoil at the company. The new research was performed by the recently disbanded “superalignment” team at OpenAI that was dedicated to studying the long-term risks posed by the technology.

The former group’s coleads Ilya Sutskever and Jan Leike, both of whom have left the OpenAI, are named as coauthors. Sutskever, a cofounder of the company and formerly chief scientist, was among the board members who voted to fire OpenAI CEO Sam Altman last November, triggering a chaotic few days that culminated in Altman’s return as leader.

ChatGPT is powered by a family of so-called large language models called GPT, based on an approach to machine learning known as artificial neural networks. These mathematical networks have shown great power to learn useful tasks by analyzing example data but their workings cannot be easily scrutinized as conventional computer programs can. The complex interplay between the layers of “neurons” within an artificial neural network makes reverse engineering why a system like ChatGPT came up with a particular response hugely challenging.

“Unlike with most human creations, we don’t really understand the inner workings of neural networks,” the researchers behind the work write in an accompanying blog post. Some prominent AI researchers believe that the most powerful AI models including ChatGPT could perhaps be used to design chemical or biological weapons and coordinate cyber attacks. A longer-term concern is that AI models may choose to hide information or act in harmful ways in order to achieve their goals.

OpenAI’s new paper outlines a technique that lessens the mystery a little, by identifying patterns that represent specific concepts inside a machine learning system with help from an additional machine learning model. The key innovation is refining the network used to peer inside the system of interest by identifying concepts, to make it more efficient.

OpenAI proved out the approach by identifying patterns that represent concepts inside GPT-4, one of its largest AI models. The company released code related to the interpretability work and a visualization tool that can be used to see how the words in different sentences activate concepts including profanity and erotic content in GPT-4 and another model. Knowing how a model represents certain concepts could be a step towards being able to dial down those associated with unwanted behavior, to keep an AI system on the rails. It could also make it possible to tune an AI system to favor certain topics or ideas.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleOppo F27 Pro+ 5G India Launch Date Set for June 13; Design, Colours, Key Features Revealed
Next Article Samsung is going all out with the Galaxy Watch Ultra

Related Articles

News

Bill Atkinson, Macintosh Pioneer and Inventor of Hypercard, Dies at 74

8 June 2025
News

Samsung Teases Z Fold Ultra, Bing Gets AI Video, and Nothing Sets A Date—Your Gear News of the Week

7 June 2025
News

The Best Backpacking Tents

7 June 2025
News

Security News This Week: The Mystery of iPhone Crashes That Apple Denies Are Linked to Chinese Hacking

7 June 2025
News

Tech Up Your Sourdough With These Upper-Crust Baking Gadgets

7 June 2025
News

Everything You Need to Know About MicroSD Express

7 June 2025
Demo
Top Articles

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 202493 Views

5 laptops to buy instead of the M4 MacBook Pro

17 November 202466 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
Gaming

Killer Inn Is Square Enix’s Take On Among Us

News Room7 June 2025
Gaming

Arc Raiders Gets October Launch Date

News Room7 June 2025
Gaming

Game of Thrones: War For Westeros Announced

News Room7 June 2025
Most Popular

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025123 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202495 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 202493 Views
Our Picks

Check Out System Shock 2: 25th Anniversary Remaster’s Multiplayer In New Gameplay Trailer

7 June 2025

Get A New Look At Sleep Awake, A Psychedelic Horror Game From Blumhouse And Nine Inch Nails’ Robin Finck

7 June 2025

Killer Inn Is Square Enix’s Take On Among Us

7 June 2025

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2025 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.