Sunday, 19 Apr 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Global Market > Nvidia claims 10x cost savings with open-source inference models
Global Market

Nvidia claims 10x cost savings with open-source inference models

Last updated: February 15, 2026 8:35 am
Published February 15, 2026
Share
Big data technology and data science illustration. Data flow concept. Querying, analysing, visualizing complex information. Neural network for artificial intelligence. Data mining. Business analytics.
SHARE

Nvidia famous that price per token went from 20 cents on the older Hopper platform to 10 cents on Blackwell. Transferring to Blackwell’s native low-precision NVFP4 format additional diminished the price to only 5 cents, so a fundamental improve gave a 4x enchancment in price per token whereas sustaining the accuracy that clients anticipate.

Nvidia outlined 4 business deployments in a blog post displaying how this mix of Blackwell infrastructure, NVFP4, optimized software program stacks and open-source fashions delivers vital price reductions. They break down like this:

  • Healthcare — In healthcare, tedious, time-consuming duties like medical coding, documentation and managing insurance coverage kinds minimize into the time docs can spend with sufferers. Sully.ai helps deal with this downside by AI brokers to deal with routine duties that take up time.

The issue is that Sully.ai’s proprietary, closed supply fashions didn’t scale effectively. So Sully.ai used Baseten’s open-source Mannequin API on Blackwell GPUs with NVFP4 knowledge format, the TensorRT-LLM library and the Dynamo inference framework .The consequence was a 90% drop in inference prices dropped by 90%, representing a 10x discount in contrast with the prior closed supply implementation, whereas response instances improved by 65% for crucial workflows like producing medical notes.

Source link

See also  F5 gateway works to protect and manage AI applications
TAGGED: 10x, Claims, Cost, Inference, models, Nvidia, opensource, savings
Share This Article
Twitter Email Copy Link Print
Previous Article artificial intelligence AI hands conceptual Arista laments ‘horrendous’ memory situation
Next Article data-center-control-it-specialists-network-monitoring IT bonuses reward network, security skills that can’t be automated
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

How a ‘vibe working’ approach at Genspark tripled ARR growth and supported a barrage of new products and features in just weeks

Need smarter insights in your inbox? Join our weekly newsletters to get solely what issues…

August 10, 2025

Elon Musk’s plan to train AI in China takes shape

Elon Musk’s electrical automobile big Tesla is reportedly making progress in utilizing information from China…

May 18, 2024

Can ChatGPT drive my car? The case for LLMs in autonomy

AI has gone big, and so have AI models. 10-billion-parameter universal models are crushing 50-million-parameter…

February 2, 2024

Lanner, Arrcus alliance targets telco edge networks with AI-powered 5G infrastructure

Lanner Electronics, an organization specializing in superior community home equipment, and Arrcus have partnered to…

March 20, 2025

The Pros and Cons of Wind Power for Data Center Sustainability | DCN

Wind could look like an amazing power supply for information facilities – and it's, in…

February 27, 2024

You Might Also Like

Preparing your organisation for the quantum threat
Global Market

12 quantum training courses from ISC2, IBM, AWS and more

By saad
Quantum computing
Global Market

Quantum developments put focus on authentication

By saad
Team of Diverse Multiethnic Software Developers Working on Computers, Programming Advanced Code, Managing Artificial Intelligence Projects Online for Innovative Cyber Security Service
Global Market

Equinix offering targets automated AI-centric network operations

By saad
data-center-mainframe-woman-it-specialist
Global Market

Data centers are costing local governments billions

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.