Monday, 30 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Global Market > Nvidia claims 10x cost savings with open-source inference models
Global Market

Nvidia claims 10x cost savings with open-source inference models

Last updated: February 15, 2026 8:35 am
Published February 15, 2026
Share
Big data technology and data science illustration. Data flow concept. Querying, analysing, visualizing complex information. Neural network for artificial intelligence. Data mining. Business analytics.
SHARE

Nvidia famous that price per token went from 20 cents on the older Hopper platform to 10 cents on Blackwell. Transferring to Blackwell’s native low-precision NVFP4 format additional diminished the price to only 5 cents, so a fundamental improve gave a 4x enchancment in price per token whereas sustaining the accuracy that clients anticipate.

Nvidia outlined 4 business deployments in a blog post displaying how this mix of Blackwell infrastructure, NVFP4, optimized software program stacks and open-source fashions delivers vital price reductions. They break down like this:

  • Healthcare — In healthcare, tedious, time-consuming duties like medical coding, documentation and managing insurance coverage kinds minimize into the time docs can spend with sufferers. Sully.ai helps deal with this downside by AI brokers to deal with routine duties that take up time.

The issue is that Sully.ai’s proprietary, closed supply fashions didn’t scale effectively. So Sully.ai used Baseten’s open-source Mannequin API on Blackwell GPUs with NVFP4 knowledge format, the TensorRT-LLM library and the Dynamo inference framework .The consequence was a 90% drop in inference prices dropped by 90%, representing a 10x discount in contrast with the prior closed supply implementation, whereas response instances improved by 65% for crucial workflows like producing medical notes.

Source link

See also  Do AI reasoning models require new approaches to prompting?
TAGGED: 10x, Claims, Cost, Inference, models, Nvidia, opensource, savings
Share This Article
Twitter Email Copy Link Print
Previous Article artificial intelligence AI hands conceptual Arista laments ‘horrendous’ memory situation
Next Article data-center-control-it-specialists-network-monitoring IT bonuses reward network, security skills that can’t be automated
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

EIT-based tactile sensor provides new approach to fine motor skills assessment

Researchers from SIT Japan confirmed a peg-shaped sensor for classifying grownup pinching motions. Reconstructed photographs…

May 22, 2024

Running serverless .NET applications in AWS Lambda

The recent evolution of .NET has been fun to watch. Since .NET became an open-source…

January 28, 2024

Tier IV unveils Edge.Auto to transform autonomous driving systems

Tier IV has launched Edge.Auto, a product that ranges from individual hardware components to fully…

January 22, 2024

Synadia secures $25 million funding for AI-powered multi-cloud and edge computing demands

Open supply cloud and edge-native messaging system Synadia Communications has closed a $25 million Sequence…

February 27, 2024

Online Panel: Facilitating a Multi-Provider Sovereign AI Cloud

As a part of the European Union's €3 billion IPCEI-CIS initiative, OpenNebula Methods hosts a…

May 9, 2025

You Might Also Like

Large AWS sign. Amazon Web Services (AWS) is a subsidiary of Amazon that provides on-demand cloud computing platforms - Las Vegas, Nevada, USA - December 3, 2019
Global Market

Amazon waives entire month’s AWS charges after Iranian drone attack

By saad
Day Two is the real stress test for AI infrastructure
Global Market

Day Two is the real stress test for AI infrastructure

By saad
air vs liquid cooling 1
Global Market

Why AI rack densities make liquid cooling nonnegotiable

By saad
Cisco building exterior with sign
Global Market

Chained vulnerabilities in Cisco Catalyst switches could induce denial-of-service

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.