Sunday, 8 Feb 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > OpenAI announces 80% price drop for o3, it’s most powerful reasoning model
AI

OpenAI announces 80% price drop for o3, it’s most powerful reasoning model

Last updated: June 10, 2025 6:34 pm
Published June 10, 2025
Share
OpenAI announces 80% price drop for o3, it's most powerful reasoning model
SHARE

Be part of the occasion trusted by enterprise leaders for practically twenty years. VB Remodel brings collectively the individuals constructing actual enterprise AI technique. Learn more


Excellent news, AI builders!

OpenAI has introduced a substantial price cut on o3, its flagship reasoning giant language mannequin (LMM), slashing prices by a whopping 80% for each enter and output tokens.

(Recall tokens are the person numeric strings that LLMs use to signify phrases, phrases, mathematical and coding strings, and different content material. They’re representations of the semantic constructions the mannequin has realized via coaching, and in essence, are the LLMs’ native language. Most LLM suppliers supply their fashions via utility programming interfaces or APIs that builders can construct apps atop of or plug their exterior apps into, and most LLM suppliers cost them for the privilege at a price per million tokens).

The replace positions the mannequin as a extra accessible possibility for builders searching for superior reasoning capabilities, and locations OpenAI in additional direct pricing competitors with rival fashions similar to Gemini 2.5 Professional from Google DeepMind, Claude Opus 4 from Anthropic, and DeepSeek’s reasoning suite.

Introduced by Altman himself on X

Sam Altman, CEO of OpenAI, confirmed the change in a post on X highlighting that the brand new pricing is meant to encourage broader experimentation, writing: “we dropped the worth of o3 by 80%!! excited to see what individuals will do with it now. suppose you’ll even be proud of o3-pro pricing for the efficiency :)”

The price of utilizing o3 is now $2 per million enter tokens and $8 per million output tokens, with an additional low cost of $0.50 per million tokens when the person enters info that has been “cached,” or is saved and equivalent to what they offered earlier than.

See also  Intuit, Uber, and State Farm trial enterprise AI agents

This marks a major discount from the earlier charges of $10 (enter) and $40 (output), as OpenAI researcher Noam Brown identified on X.

Ray Fernando, a developer and early adopter, celebrated the pricing drop in a publish writing “LFG!” brief for “let’s fucking go!”

The sentiment displays a rising enthusiasm amongst builders trying to scale their initiatives with out prohibitive mannequin entry prices.

Value comparability to different rival reasoning LLMs

The worth adjustment comes at a time when AI suppliers are competing extra aggressively on each efficiency and affordability. A comparability with different main AI reasoning fashions illustrates how important this transfer may very well be:

  • Gemini 2.5 Professional Preview, developed by Google DeepMind, prices between $1.25 and $2.50 for enter relying on immediate measurement, and $10 to $15 for output. Whereas its integration with Google Search affords extra performance, that service carries its personal value — free for the primary 1,500 requests per day, then $35 per thousand requests.
  • Claude Opus 4, marketed by Anthropic as a mannequin optimized for complicated duties, is the most costly of the group, charging $15 per million enter tokens and $75 for output. Immediate caching learn and write providers come at $1.50 and $18.75 respectively, though customers can unlock a 50% low cost with batch processing.
  • DeepSeek’s fashions, notably DeepSeek-Reasoner and DeepSeek-Chat, undercut a lot of the market with aggressive low pricing. Enter tokens vary from $0.07 to $0.55 relying on caching and time of day, whereas output ranges from $1.10 to $2.19. Discounted charges throughout off-peak hours deliver costs down even additional, to as little as $0.035 for cached inputs.
See also  OpenAI makes GPT-4 Turbo with Vision API generally available
MannequinEnterCached EnterOutputLow cost Notes
OpenAI o3$2.00 (down from $10.00)$0.50$8.00 (down from $40.00)Flex Processing: $5 / $20
Gemini 2.5 Professional$1.25 – $2.50$0.31 – $0.625$10.00 – $15.00Increased fee applies to prompts >200k tokens
Claude Opus 4$15.00$1.50 (learn) / $18.75 (write)$75.0050% off with batch processing
DeepSeek-Chat$0.07 (hit)$0.27 (miss)—$1.1050% off throughout off-peak hours
DeepSeek-Reasoner$0.14 (hit)$0.55 (miss)—$2.1975% off throughout off-peak hours

As well as, unbiased third-party AI mannequin comparability and analysis group Synthetic Evaluation ran the brand new o3 via its suite of benchmarking assessments on numerous duties, and located it value $390 to finish all of them, versus $971 for Gemini 2.5 Professional and $342 for Claude 4 Sonnet.

Narrowing the associated fee vs. intelligence hole for builders

OpenAI’s pricing transfer not solely narrows the hole with ultra-low-cost fashions like DeepSeek but additionally places downward strain on higher-priced choices like Claude Opus and Gemini Professional.

In contrast to Claude or Gemini, OpenAI’s o3 additionally now affords a flex mode for synchronous processing that prices $5 for enter and $20 for output per million tokens, giving builders extra management over compute value and latency relying on workload kind.

o3 is at present out there via the OpenAI API and Playground. Customers with balances as little as just a few {dollars} can now discover the mannequin’s full capabilities, enabling prototyping and deployment with fewer monetary limitations.

This might notably profit startups, analysis groups, and particular person builders who beforehand discovered higher-tier mannequin entry cost-prohibitive.

See also  Samsung benchmarks real productivity of enterprise AI models

By considerably decreasing the price of its most superior reasoning mannequin, OpenAI is signaling a broader pattern within the generative AI area: premium efficiency is shortly turning into extra inexpensive, and builders now have a rising variety of viable, economically scalable choices.


Source link
TAGGED: announces, Drop, Model, OpenAI, powerful, price, reasoning
Share This Article
Twitter Email Copy Link Print
Previous Article AI Data Centers Have a Weight Problem AI Data Centers Have a Weight Problem
Next Article TSMC Evaluates Building Advanced Chip Plant in the UAE TSMC Evaluates Building Advanced Chip Plant in the UAE
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

How to log out of a Linux system from a script

If you happen to run a script utilizing a command like exec myscript and the…

July 30, 2024

Net-Zero AI Data Center Project Gets a $5B Boost in Saudi Arabia

Saudi Arabia-based knowledge middle developer DataVolt is investing $5 billion in a net-zero AI knowledge…

February 14, 2025

3% IT budget increases fueled by AI, security, networking

How they plan to spend their IT greenback can also be of notice, with respondents…

September 15, 2024

How cloud providers are tackling GPU shortages with custom chips

GPUs are the spine of AI computing, however as demand exceeds provide, cloud suppliers are…

December 6, 2024

TensorStax Raises $5M in Seed Funding

TensorStax, a San Francisco, CA-based supplier of an agentic platform for knowledge engineering, raised $5M…

May 12, 2025

You Might Also Like

SuperCool review: Evaluating the reality of autonomous creation
AI

SuperCool review: Evaluating the reality of autonomous creation

By saad
Top 7 best AI penetration testing companies in 2026
AI

Top 7 best AI penetration testing companies in 2026

By saad
Intuit, Uber, and State Farm trial AI agents inside enterprise workflows
AI

Intuit, Uber, and State Farm trial enterprise AI agents

By saad
Riello UPS announces new M2X modular power system
Design

Riello UPS announces new M2X modular power system

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.