Saturday, 28 Feb 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Global Market > Perplexity’s open-source tool to run trillion-parameter models without costly upgrades
Global Market

Perplexity’s open-source tool to run trillion-parameter models without costly upgrades

Last updated: November 9, 2025 11:40 am
Published November 9, 2025
Share
LLMs, ChatGPT, Generative AI
SHARE

The apparent reply can be Nvidia’s new GB200 programs, primarily one big 72-GPU server. However these value thousands and thousands, face excessive provide shortages, and aren’t accessible all over the place, the researchers famous. In the meantime, H100 and H200 programs are plentiful and comparatively low-cost.

The catch: operating giant fashions throughout a number of older programs has historically meant brutal efficiency penalties. “There are not any viable cross-provider options for LLM inference,” the analysis staff wrote, noting that current libraries both lack AWS assist totally or endure extreme efficiency degradation on Amazon’s {hardware}.

TransferEngine goals to alter that. “TransferEngine allows transportable point-to-point communication for contemporary LLM architectures, avoiding vendor lock-in whereas complementing collective libraries for cloud-native deployments,” the researchers wrote.

How TransferEngine works

TransferEngine acts as a common translator for GPU-to-GPU communication, in accordance with the paper. It creates a standard interface that works throughout completely different networking {hardware} by figuring out the core performance shared by numerous programs.

TransferEngine makes use of RDMA (Distant Direct Reminiscence Entry) expertise. This enables computer systems to switch information immediately between graphics playing cards with out involving the primary processor—consider it as a devoted categorical lane between chips.

Perplexity’s implementation achieved 400 gigabits per second throughput on each Nvidia ConnectX-7 and AWS EFA, matching current single-platform options. TransferEngine additionally helps utilizing a number of community playing cards per GPU, aggregating bandwidth for even quicker communication.

Source link

See also  Open-source revolution: How DeepSeek-R1 challenges OpenAI's o1 with superior processing, cost efficiency
TAGGED: costly, models, opensource, Perplexitys, Run, tool, Trillionparameter, upgrades
Share This Article
Twitter Email Copy Link Print
Previous Article Quantifying AI ROI in strategy Quantifying AI ROI in strategy
Next Article NYU’s new AI architecture makes high-quality image generation faster and cheaper NYU’s new AI architecture makes high-quality image generation faster and cheaper
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Microsoft is quadrupling its AI investment in Spain

Microsoft has announced plans to considerably enhance its funding in AI and cloud infrastructure in…

February 21, 2024

AutoIVF Closes Funding Round

AutoIVF Inc., a Natick, MA-based fertility care firm, raised an undisclosed quantity in funding. The…

May 10, 2025

ZEDEDA deepens NVIDIA integration to streamline enterprise edge AI deployment

ZEDEDA has enhanced its integration with NVIDIA’s edge AI platform, together with assist for NVIDIA…

March 17, 2025

Hurricane Electric Expands Network in New Zealand With New PoP in Auckland

Information Vault Auckland gives a carrier-neutral, extremely safe surroundings with connectivity to main Web exchanges…

November 29, 2025

arch.law acquires leading data centre specialist law firm Conexus Law

Because it was established in 2020 Conexus has constructed a enterprise and worldwide functionality advising…

June 9, 2024

You Might Also Like

AI
Global Market

OpenAI launches stateful AI on AWS, signaling a control plane power shift

By saad
AI is rewriting the rules of data centre power – who wins?
Global Market

AI is rewriting the rules of data centre power – who wins?

By saad
Spotlight report: Accelerating Data Center Modernization
Global Market

Spotlight report: Accelerating Data Center Modernization

By saad
The next AI race may not be on Earth at all
Global Market

The next AI race may not be on Earth at all

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.