Tuesday, 10 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Global Market > Nvidia claims 10x cost savings with open-source inference models
Global Market

Nvidia claims 10x cost savings with open-source inference models

Last updated: February 15, 2026 8:35 am
Published February 15, 2026
Share
Big data technology and data science illustration. Data flow concept. Querying, analysing, visualizing complex information. Neural network for artificial intelligence. Data mining. Business analytics.
SHARE

Nvidia famous that price per token went from 20 cents on the older Hopper platform to 10 cents on Blackwell. Transferring to Blackwell’s native low-precision NVFP4 format additional diminished the price to only 5 cents, so a fundamental improve gave a 4x enchancment in price per token whereas sustaining the accuracy that clients anticipate.

Nvidia outlined 4 business deployments in a blog post displaying how this mix of Blackwell infrastructure, NVFP4, optimized software program stacks and open-source fashions delivers vital price reductions. They break down like this:

  • Healthcare — In healthcare, tedious, time-consuming duties like medical coding, documentation and managing insurance coverage kinds minimize into the time docs can spend with sufferers. Sully.ai helps deal with this downside by AI brokers to deal with routine duties that take up time.

The issue is that Sully.ai’s proprietary, closed supply fashions didn’t scale effectively. So Sully.ai used Baseten’s open-source Mannequin API on Blackwell GPUs with NVFP4 knowledge format, the TensorRT-LLM library and the Dynamo inference framework .The consequence was a 90% drop in inference prices dropped by 90%, representing a 10x discount in contrast with the prior closed supply implementation, whereas response instances improved by 65% for crucial workflows like producing medical notes.

Source link

See also  Kickstart Europe 2025 - HostingJournalist.com
TAGGED: 10x, Claims, Cost, Inference, models, Nvidia, opensource, savings
Share This Article
Twitter Email Copy Link Print
Previous Article artificial intelligence AI hands conceptual Arista laments ‘horrendous’ memory situation
Next Article data-center-control-it-specialists-network-monitoring IT bonuses reward network, security skills that can’t be automated
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Pepeto’s $600K Presale Highlights Vision for Supporting Memecoins Ahead of 2025

London, uk, November twenty second, 2024, Chainwire The memecoin market is evolving quickly, with Pepeto…

November 22, 2024

How the EPIQC project is empowering the quantum computing revolution

The EPIQC project helps to take the quantum world forwards, each when it comes to…

February 17, 2024

Nebius to triple capacity at Finland data centre to 75 MW

The enlargement of the Finnish information middle – a state-of-the-art location with robust inexperienced credentials…

October 8, 2024

Dream Exchange Receives Investment from Harvey Catchings

Dream Exchange, a Chicago, IL-based minority-controlled inventory trade in america, acquired an funding from Harvey Catchings.…

May 24, 2024

Pepeto and Pepe Unchained Introduce zero fee trading and cross chain solutions vs layer 2 tech

London, uk, November twenty ninth, 2024, Chainwire   As Bitcoin edges nearer to the $100K…

November 29, 2024

You Might Also Like

Nscale lands $2bn funding as former Meta bigwigs join board
Global Market

Nscale lands $2bn funding as former Meta bigwigs join board

By saad
Verne appoints former NTT exec Wayne Louw as COO
Global Market

Verne appoints former NTT exec Wayne Louw as COO

By saad
Data centres don’t have to come at the expense of urban growth
Global Market

Data centres don’t have to come at the expense of urban growth

By saad
World map dots on blue background
Global Market

Digital sovereignty options for on-prem deployments

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.