Close Menu
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
Amazon’s rumored AI phone might be dead on arrival, says analyst

Amazon’s rumored AI phone might be dead on arrival, says analyst

22 March 2026
You Asked: What is QLED+? Can a Mini LED TV be edge lit?

You Asked: What is QLED+? Can a Mini LED TV be edge lit?

22 March 2026
Chrome on iPhone is putting Gemini front and center in your browsing

Chrome on iPhone is putting Gemini front and center in your browsing

22 March 2026
Facebook X (Twitter) Instagram
Just In
  • Amazon’s rumored AI phone might be dead on arrival, says analyst
  • You Asked: What is QLED+? Can a Mini LED TV be edge lit?
  • Chrome on iPhone is putting Gemini front and center in your browsing
  • Samsung’s next mid-range Galaxy A57 and Galaxy A37 finally get a launch date
  • The Round of 64: AI-ok 
  • The Best Subscription-Free Home Security Cameras I’ve Tried
  • Google Translate is getting a pronunciation coach to fix your awkward accent
  • Give Your Phone a Huge (and Free) Upgrade by Switching to Another Keyboard
Facebook X (Twitter) Instagram Pinterest Vimeo
Best in TechnologyBest in Technology
  • News
  • Phones
  • Laptops
  • Gadgets
  • Gaming
  • AI
  • Tips
  • More
    • Web Stories
    • Global
    • Press Release
Subscribe
Best in TechnologyBest in Technology
Home » OpenAI shrinks GPT-5.4 for speed and lower costs
News

OpenAI shrinks GPT-5.4 for speed and lower costs

News RoomBy News Room18 March 20263 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
OpenAI shrinks GPT-5.4 for speed and lower costs
Share
Facebook Twitter LinkedIn Pinterest Email

OpenAI is scaling its latest models down to hit a different target, faster responses and much lower costs. The new GPT-5.4 mini and nano are built for developers who care more about responsiveness than squeezing out every last bit of reasoning power.

Both models are available starting today. GPT-5.4 mini runs more than twice as fast as its predecessor while staying close to the full GPT-5.4 on key benchmarks. GPT-5.4 nano takes that further, focusing on simpler tasks like classification and data extraction where efficiency matters most.

This approach fits apps where speed shapes the experience. Coding assistants, background agents, and real-time vision tools depend on quick feedback, and in those cases a slightly smaller model often delivers a better overall result.

How much performance you actually lose

The performance gap between models is narrower than you might expect. GPT-5.4 mini scores 54.4 percent on SWE-Bench Pro, compared to 57.7 percent for the full model. On OSWorld-Verified, the mini reaches 72.1 percent while the larger version hits 75 percent, keeping the difference tight across tasks.

Costs drop far more dramatically. GPT-5.4 mini is priced at $0.75 per million input tokens and $4.50 per million output tokens, while nano comes in at $0.20 and $1.25. Both models support text and image inputs, tool use, function calling, and a 400,000 token context window, so the lower price doesn’t strip away core capabilities.

In Codex, the mini model uses just 30 percent of the GPT-5.4 quota. That lets developers shift routine coding work to a cheaper tier while saving the full model for harder reasoning.

When smaller models do the heavy lifting

OpenAI is also pushing a multi-model workflow. Instead of relying on one system, developers can split work across tiers, pairing a larger model for planning with smaller ones handling execution.

That setup reflects how many real apps already behave. One model can review a codebase or decide on changes, while another processes supporting data or repetitive steps. The smaller model handles the predictable work, while the larger one focuses on judgment and coordination.

Computer, Computer Hardware, Computer Keyboard

Early feedback suggests this mix is effective. Hebbia CTO Aabhas Sharma reported that GPT-5.4 mini matched or outperformed competing models on several tasks at a lower cost, and in some cases even delivered stronger end-to-end results than the full GPT-5.4.

What to use and when

GPT-5.4 mini is now available across the API, Codex, and ChatGPT. Free and Go users can access it through the Thinking option, while other users may see it as a fallback when they hit limits on GPT-5.4 Thinking.

The nano model is currently limited to the API, aimed at teams running high-volume workloads where cost control is critical. Both models are live today with full documentation available.

For developers building real-time AI features, the shift is clear. Smaller models are now capable enough to handle a larger share of everyday work, which makes choosing the right balance of speed, cost, and capability an increasingly practical decision.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleThe 4 Best Monitor Arms
Next Article Boroux Versus Rorra Countertop Water Filters, Tested Head to Head

Related Articles

Amazon’s rumored AI phone might be dead on arrival, says analyst
News

Amazon’s rumored AI phone might be dead on arrival, says analyst

22 March 2026
You Asked: What is QLED+? Can a Mini LED TV be edge lit?
News

You Asked: What is QLED+? Can a Mini LED TV be edge lit?

22 March 2026
Chrome on iPhone is putting Gemini front and center in your browsing
News

Chrome on iPhone is putting Gemini front and center in your browsing

22 March 2026
Samsung’s next mid-range Galaxy A57 and Galaxy A37 finally get a launch date
News

Samsung’s next mid-range Galaxy A57 and Galaxy A37 finally get a launch date

22 March 2026
The Round of 64: AI-ok 
News

The Round of 64: AI-ok 

22 March 2026
The Best Subscription-Free Home Security Cameras I’ve Tried
News

The Best Subscription-Free Home Security Cameras I’ve Tried

22 March 2026
Demo
Top Articles
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024130 Views
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024111 Views
Costco partners with Electric Era to bring back EV charging in the U.S.

Costco partners with Electric Era to bring back EV charging in the U.S.

28 October 2024100 Views

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Latest News
The Best Subscription-Free Home Security Cameras I’ve Tried News

The Best Subscription-Free Home Security Cameras I’ve Tried

News Room22 March 2026
Google Translate is getting a pronunciation coach to fix your awkward accent News

Google Translate is getting a pronunciation coach to fix your awkward accent

News Room22 March 2026
Give Your Phone a Huge (and Free) Upgrade by Switching to Another Keyboard News

Give Your Phone a Huge (and Free) Upgrade by Switching to Another Keyboard

News Room22 March 2026
Most Popular
The Spectacular Burnout of a Solar Panel Salesman

The Spectacular Burnout of a Solar Panel Salesman

13 January 2025137 Views
5 laptops to buy instead of the M4 MacBook Pro

5 laptops to buy instead of the M4 MacBook Pro

17 November 2024130 Views
ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

ChatGPT o1 vs. o1-mini vs. 4o: Which should you use?

15 December 2024111 Views
Our Picks
Samsung’s next mid-range Galaxy A57 and Galaxy A37 finally get a launch date

Samsung’s next mid-range Galaxy A57 and Galaxy A37 finally get a launch date

22 March 2026
The Round of 64: AI-ok 

The Round of 64: AI-ok 

22 March 2026
The Best Subscription-Free Home Security Cameras I’ve Tried

The Best Subscription-Free Home Security Cameras I’ve Tried

22 March 2026

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact Us
© 2026 Best in Technology. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.