Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

The Gigabyte GeForce RTX 5080 GPU is $100 off, but there’s a catch

15 May 2025

Blocked From Selling Off-Brand Ozempic, Telehealth Startups Embrace a Less Effective Drug

15 May 2025

Samsung Galaxy S25 Price in India Discounted via Upgrade Bonus and Cashback Offers

15 May 2025
Facebook X (Twitter) Instagram
Just In
  • The Gigabyte GeForce RTX 5080 GPU is $100 off, but there’s a catch
  • Blocked From Selling Off-Brand Ozempic, Telehealth Startups Embrace a Less Effective Drug
  • Samsung Galaxy S25 Price in India Discounted via Upgrade Bonus and Cashback Offers
  • Asus ROG Strix OLED XG32U Series Gaming Monitors With Up to 480Hz Refresh Rate Announced
  • Cheaper EVs ahead? GM and LG say new battery cells are the key
  • iPhone 20th Anniversary to Get Bezel-Free Screen, Under-Display Camera and More: Report
  • This new movie will rule the weekend box office — watch the trailer
  • Taiwan’s Computex to Showcase AI Advances, Nvidia’s Huang to Take Centre Stage
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » OpenAI teases its ‘breakthrough’ next-generation o3 reasoning model
News

OpenAI teases its ‘breakthrough’ next-generation o3 reasoning model

News RoomBy News Room21 December 20243 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

For the finale of its 12 Days of OpenAI livestream event, CEO Sam Altman revealed its next foundation model, and successor to the recently announced o1 family of reasoning AIs, dubbed o3 and 03-mini.

And no, you aren’t going crazy — OpenAI skipped right over o2, apparently to avoid infringing on the copyright of British telecom provider O2.

While the new o3 models are not being released to the public just yet and there’s no word on when they’ll be incorporated into ChatGPT, they are now available for testing by safety and security researchers.

o3, our latest reasoning model, is a breakthrough, with a step function improvement on our hardest benchmarks. we are starting safety testing & red teaming now. https://t.co/4XlK1iHxFK

— Greg Brockman (@gdb) December 20, 2024

The o3 family, like the o1’s before it, operate differently than traditional generative models in that they will internally fact-check their responses prior to presenting them to the user. While this technique slows the model’s response time anywhere from a few seconds to a few minutes, its answers to complex science, math, and coding queries tend to be more accurate and reliable than what you’d get from GPT-4. Additionally, the model is actually able to transparently explain its reasoning in how it arrived at its result.

Users can also manually adjust the amount of time the model spends considering a problem by selecting between low, medium, and high compute with the highest setting returning the most complete answers. That performance does not come cheap, mind you. The processing at high compute reportedly will cost thousands of dollars per task, ARC-AGI co-creator Francois Chollet wrote in an X post Friday.

Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks.

It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task… pic.twitter.com/ESQ9CNVCEA

— François Chollet (@fchollet) December 20, 2024

The new family of reasoning models reportedly offer significantly improved performance over even o1, which debuted in September, on the industry’s most challenging benchmark tests. According to the company, o3 outperforms its predecessor by nearly 23 percentage points on the SWE-Bench Verified coding test and scores more than 60 points higher than o1 on  Codeforce’s benchmark. The new model also scored an impressive 96.7% on the AIME 2024 mathematics test, missing just one question, and outperformed human experts on the GPQA Diamond, notching a score of 87.7%. Even more impressive, 03 reportedly solved more than a quarter of the problems presented on the EpochAI Frontier Math benchmark, where other models have struggled to correctly solve more than 2% of them.

OpenAI does note that the models it previewed on Friday are still early versions and that “final results may evolve with more post-training.” The company has additionally incorporated new “deliberative alignment” safety measures into o3’s training methodology. The o1 reasoning model has shown a troubling habit of trying to deceive human evaluators at a higher rate than conventional AIs like GPT-4o, Gemini, or Claude; OpenAI believes that the new guardrails will help minimize those tendencies in o3.

Members of the research community interested in trying o3-mini for themselves can sign up for access on OpenAI’s waitlist.











Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous Article3 great free movies to stream this weekend (December 20-22)
Next Article OnePlus 13R Confirmed to Run on Snapdragon 8 Gen 3 Chipset

Related Articles

News

The Gigabyte GeForce RTX 5080 GPU is $100 off, but there’s a catch

15 May 2025
News

Blocked From Selling Off-Brand Ozempic, Telehealth Startups Embrace a Less Effective Drug

15 May 2025
News

Cheaper EVs ahead? GM and LG say new battery cells are the key

15 May 2025
News

This new movie will rule the weekend box office — watch the trailer

15 May 2025
News

YouTube starts using AI to make ads annoyingly difficult to avoid

15 May 2025
News

Apple Maps will now help you dine at the finest with a side of golfing

15 May 2025
Demo
Top Articles

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202493 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 202486 Views

5 laptops to buy instead of the M4 MacBook Pro

17 November 202458 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
Phones

iPhone 20th Anniversary to Get Bezel-Free Screen, Under-Display Camera and More: Report

News Room15 May 2025
News

This new movie will rule the weekend box office — watch the trailer

News Room15 May 2025
Laptops

Taiwan’s Computex to Showcase AI Advances, Nvidia’s Huang to Take Centre Stage

News Room15 May 2025
Most Popular

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025120 Views

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 202493 Views

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 202486 Views
Our Picks

Asus ROG Strix OLED XG32U Series Gaming Monitors With Up to 480Hz Refresh Rate Announced

15 May 2025

Cheaper EVs ahead? GM and LG say new battery cells are the key

15 May 2025

iPhone 20th Anniversary to Get Bezel-Free Screen, Under-Display Camera and More: Report

15 May 2025

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2025 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.