Friday, 20 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Global Market > Perplexity’s open-source tool to run trillion-parameter models without costly upgrades
Global Market

Perplexity’s open-source tool to run trillion-parameter models without costly upgrades

Last updated: November 9, 2025 11:40 am
Published November 9, 2025
Share
LLMs, ChatGPT, Generative AI
SHARE

The apparent reply can be Nvidia’s new GB200 programs, primarily one big 72-GPU server. However these value thousands and thousands, face excessive provide shortages, and aren’t accessible all over the place, the researchers famous. In the meantime, H100 and H200 programs are plentiful and comparatively low-cost.

The catch: operating giant fashions throughout a number of older programs has historically meant brutal efficiency penalties. “There are not any viable cross-provider options for LLM inference,” the analysis staff wrote, noting that current libraries both lack AWS assist totally or endure extreme efficiency degradation on Amazon’s {hardware}.

TransferEngine goals to alter that. “TransferEngine allows transportable point-to-point communication for contemporary LLM architectures, avoiding vendor lock-in whereas complementing collective libraries for cloud-native deployments,” the researchers wrote.

How TransferEngine works

TransferEngine acts as a common translator for GPU-to-GPU communication, in accordance with the paper. It creates a standard interface that works throughout completely different networking {hardware} by figuring out the core performance shared by numerous programs.

TransferEngine makes use of RDMA (Distant Direct Reminiscence Entry) expertise. This enables computer systems to switch information immediately between graphics playing cards with out involving the primary processor—consider it as a devoted categorical lane between chips.

Perplexity’s implementation achieved 400 gigabits per second throughput on each Nvidia ConnectX-7 and AWS EFA, matching current single-platform options. TransferEngine additionally helps utilizing a number of community playing cards per GPU, aggregating bandwidth for even quicker communication.

Source link

See also  Online tool helps streamline data centre planning and design
TAGGED: costly, models, opensource, Perplexitys, Run, tool, Trillionparameter, upgrades
Share This Article
Twitter Email Copy Link Print
Previous Article Quantifying AI ROI in strategy Quantifying AI ROI in strategy
Next Article NYU’s new AI architecture makes high-quality image generation faster and cheaper NYU’s new AI architecture makes high-quality image generation faster and cheaper
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Vittori Partners with Totum 3D and ShapeUp Studios for Titanium Additive Production

2100 Geng Street, United States, August 14th, 2025, FinanceWire Vittori introduced a manufacturing partnership with…

August 14, 2025

This new AI technique creates ‘digital twin’ consumers, and it could kill the traditional survey industry

A brand new research paper quietly revealed final week outlines a breakthrough methodology that enables…

October 13, 2025

Reddit is reportedly selling data for AI training

Reddit has negotiated a content material licensing deal to permit its information for use for…

February 19, 2024

When is journalism hacking? – The Verge

Some legal guidelines function like hidden lure doorways — everybody walks throughout the lure at one…

February 24, 2024

Echion Tecnologies Raises £29M in Series B Funding

Echion Tecnologies, a Sawston, UK-based developer of niobium-based, fast-charging battery supplies, raised £29M in Collection…

June 15, 2024

You Might Also Like

Cloud Computing Disaster Recovery Solutions Concept - Cloud DR - Services Companies Use for the Purpose of Backing Up Resources into a Cloud Environment - 3D Illustration
Global Market

Nile adds microsegmentation and native NAC to its secure NaaS platform

By saad
Planning delays continue to delay Tritax's Slough data centre
Global Market

Planning delays continue to delay Tritax’s Slough data centre

By saad
A photograph of a row of Ethernet cables plugged into ports, with a warning sign illuminated above one of the ports.
Global Market

Telnet vulnerability opens door to remote code execution as root

By saad
Could Telehouse be about to add a sixth data centre to its Docklands campus?
Global Market

Could Telehouse be about to add a sixth data centre to its Docklands campus?

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.