Sunday, 8 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Global Market > Nvidia claims 10x cost savings with open-source inference models
Global Market

Nvidia claims 10x cost savings with open-source inference models

Last updated: February 15, 2026 8:35 am
Published February 15, 2026
Share
Big data technology and data science illustration. Data flow concept. Querying, analysing, visualizing complex information. Neural network for artificial intelligence. Data mining. Business analytics.
SHARE

Nvidia famous that price per token went from 20 cents on the older Hopper platform to 10 cents on Blackwell. Transferring to Blackwell’s native low-precision NVFP4 format additional diminished the price to only 5 cents, so a fundamental improve gave a 4x enchancment in price per token whereas sustaining the accuracy that clients anticipate.

Nvidia outlined 4 business deployments in a blog post displaying how this mix of Blackwell infrastructure, NVFP4, optimized software program stacks and open-source fashions delivers vital price reductions. They break down like this:

  • Healthcare — In healthcare, tedious, time-consuming duties like medical coding, documentation and managing insurance coverage kinds minimize into the time docs can spend with sufferers. Sully.ai helps deal with this downside by AI brokers to deal with routine duties that take up time.

The issue is that Sully.ai’s proprietary, closed supply fashions didn’t scale effectively. So Sully.ai used Baseten’s open-source Mannequin API on Blackwell GPUs with NVFP4 knowledge format, the TensorRT-LLM library and the Dynamo inference framework .The consequence was a 90% drop in inference prices dropped by 90%, representing a 10x discount in contrast with the prior closed supply implementation, whereas response instances improved by 65% for crucial workflows like producing medical notes.

Source link

See also  Nvidia, Hugging Face and ServiceNow release new StarCoder2 LLMs for code generation
TAGGED: 10x, Claims, Cost, Inference, models, Nvidia, opensource, savings
Share This Article
Twitter Email Copy Link Print
Previous Article artificial intelligence AI hands conceptual Arista laments ‘horrendous’ memory situation
Next Article data-center-control-it-specialists-network-monitoring IT bonuses reward network, security skills that can’t be automated
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

EdgeUno and AtlasCloud launch GPU push to accelerate Latin America’s AI buildout

EdgeUno, a supplier of Infrastructure as a Service (IaaS) in Latin America and AtlasCloud have…

December 2, 2025

Google’s Dublin data centre plans rejected amid energy concerns

To offer the perfect experiences, we use applied sciences like cookies to retailer and/or entry…

September 2, 2024

Revolutionising digital infrastructure with AI integration

Richard Osborne, CTO of Purple Rework, discusses leveraging AI to rework present digital infrastructure to…

March 28, 2024

AI-powered LED system delivers stable wireless power for indoor IoT devices

The proposed OWPT system ensures seamless energy transmission in each darkish and shiny environments, can…

November 15, 2025

Comcast utilizes edge computing to elevate streaming and standardize content delivery

Comcast, a worldwide media and expertise firm is deploying Qwilt’s Open Edge platform to boost…

October 1, 2024

You Might Also Like

World map dots on blue background
Global Market

Digital sovereignty options for on-prem deployments

By saad
Flexibility trial could see data centres given faster grid connections
Global Market

Flexibility trial could see data centres given faster grid connections

By saad
Data center / enterprise networking
Global Market

Cisco: LPO not a panacea but plays strategic role in AI networks

By saad
Panasonic launches new unit dedicated to liquid cooling
Global Market

Panasonic launches new unit dedicated to liquid cooling

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.