Monday, 13 Apr 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Nvidia and DataStax just made generative AI smarter and leaner — here’s how
AI

Nvidia and DataStax just made generative AI smarter and leaner — here’s how

Last updated: December 26, 2024 5:08 pm
Published December 26, 2024
Share
Nvidia and DataStax just made generative AI smarter and leaner — here’s how
SHARE

Be part of our each day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Study Extra


Nvidia and DataStax launched new know-how immediately that dramatically reduces storage necessities for corporations deploying generative AI programs, whereas enabling quicker and extra correct data retrieval throughout a number of languages.

The brand new Nvidia NeMo Retriever microservices, built-in with DataStax’s AI platform, cuts knowledge storage quantity by 35 instances in comparison with conventional approaches — a vital functionality, as enterprise knowledge is projected to achieve more than 20 zettabytes by 2027.

“At this time’s enterprise unstructured knowledge is at 11 zettabytes, roughly equal to 800,000 copies of the Library of Congress, and 83% of that’s unstructured with 50% being audio and video,” stated Kari Briski, VP of product administration for AI at Nvidia, in an interview with VentureBeat. “Considerably lowering these storage prices whereas enabling corporations to successfully embed and retrieve data turns into a recreation changer.”

Nvidia’s NeMo Retriever know-how delivers a 35x enchancment in knowledge storage effectivity, as illustrated in a comparability of uncooked textual content storage, baseline vector embeddings, and lowered embedding dimensions. This breakthrough underpins the scalability of generative AI throughout enterprise functions. (Credit score: Nvidia)

The know-how is already proving transformative for Wikimedia Foundation, which used the built-in answer to scale back processing time for 10 million Wikipedia entries from 30 days to underneath three days. The system handles real-time updates throughout a whole bunch of hundreds of entries being edited each day by 24,000 international volunteers.

“You’ll be able to’t simply depend on massive language fashions for content material — you want context out of your present enterprise knowledge,” defined Chet Kapoor, CEO of DataStax. “That is the place our hybrid search functionality is available in, combining each semantic search and conventional textual content search, then utilizing Nvidia’s re-ranker know-how to ship essentially the most related leads to actual time at international scale.”

See also  Amazon strives to outpace Nvidia with cheaper, faster AI chips

Enterprise knowledge safety meets AI accessibility

The partnership addresses a vital problem going through enterprises: the way to make their huge shops of personal knowledge accessible to AI programs with out exposing delicate data to exterior language fashions.

“Take FedEx — 60% of their knowledge sits in our merchandise, together with all package deal supply data for the previous 20 years with private particulars. That’s not going to Gemini or OpenAI anytime quickly, or ever,” Kapoor defined.

The know-how is discovering early adoption throughout industries, with monetary companies companies main the cost regardless of regulatory constraints. “I’ve been blown away by how far forward monetary companies companies are actually,” stated Kapoor, citing Commonwealth Bank of Australia and Capital One as examples.

The subsequent frontier for AI: Multimodal doc processing

Wanting forward, Nvidia plans to broaden the know-how’s capabilities to deal with extra advanced doc codecs. “We’re seeing nice outcomes with multimodal PDF processing — understanding tables, graphs, charts and pictures and the way they relate throughout pages,” Briski revealed. “It’s a very onerous drawback that we’re excited to sort out.”

For enterprises drowning in unstructured knowledge whereas attempting to deploy AI responsibly, the brand new providing supplies a path to make their data belongings AI-ready with out compromising safety or breaking the financial institution on storage prices. The answer is on the market instantly via the Nvidia API catalog with a 90-day free trial license.

The announcement underscores the rising give attention to enterprise AI infrastructure as corporations transfer past experimentation to large-scale deployment, with knowledge administration and value effectivity turning into vital success elements.

See also  Test-driving Google's Gemini-Exp-1206 model in data analysis, visualizations

Source link
TAGGED: DataStax, generative, Heres, leaner, Nvidia, smarter
Share This Article
Twitter Email Copy Link Print
Previous Article Top 10 Data Center Security Stories of 2024 Top 10 Data Center Security Stories of 2024
Next Article In the Shadows of Arizona’s Data Center Boom, Thousands Live Without Power In the Shadows of Arizona’s Data Center Boom, Thousands Live Without Power
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

The FBI says Russian emails are sending fake bomb threats to polling stations

The Federal Bureau of Investigation has issued a warning that pretend bomb threats are being…

November 6, 2024

Fighting the skills shortage war in the right way

With expertise shortages intensifying, information centre operators ought to prioritise problem-solving and studying potential over…

November 9, 2024

The perfect certificate migration until it wasn’t: How certificates can break RADIUS trusts

Most significantly, including the basis certificates on the AOS change is famous as an automatic…

January 16, 2026

AWS launches in-line Q Developer AI coding assistant

Be a part of our day by day and weekly newsletters for the newest updates…

October 30, 2024

Key Trends and Technologies Impacting Data Centers in 2024 and Beyond | DCN

There are various developments and applied sciences affecting knowledge facilities on a world scale. These…

March 7, 2024

You Might Also Like

Nvidia GTC 2026 Vera Rubin
Global Market

Nvidia Rubin GPUs may be delayed, slowing the next phase of AI infrastructure

By saad
Did Meta Sacrifice Its Open-Source Identity for a Competitive AI Model?
AI

Did Meta Sacrifice Its Open-Source Identity for a Competitive AI Model?

By saad
How robust AI governance protects enterprise margins
AI

How robust AI governance protects enterprise margins

By saad
Why companies like Apple are building AI agents with limits
AI

Why companies like Apple are building AI agents with limits

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.