Friday, 10 Apr 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Global Market > Perplexity’s open-source tool to run trillion-parameter models without costly upgrades
Global Market

Perplexity’s open-source tool to run trillion-parameter models without costly upgrades

Last updated: November 9, 2025 11:40 am
Published November 9, 2025
Share
LLMs, ChatGPT, Generative AI
SHARE

The apparent reply can be Nvidia’s new GB200 programs, primarily one big 72-GPU server. However these value thousands and thousands, face excessive provide shortages, and aren’t accessible all over the place, the researchers famous. In the meantime, H100 and H200 programs are plentiful and comparatively low-cost.

The catch: operating giant fashions throughout a number of older programs has historically meant brutal efficiency penalties. “There are not any viable cross-provider options for LLM inference,” the analysis staff wrote, noting that current libraries both lack AWS assist totally or endure extreme efficiency degradation on Amazon’s {hardware}.

TransferEngine goals to alter that. “TransferEngine allows transportable point-to-point communication for contemporary LLM architectures, avoiding vendor lock-in whereas complementing collective libraries for cloud-native deployments,” the researchers wrote.

How TransferEngine works

TransferEngine acts as a common translator for GPU-to-GPU communication, in accordance with the paper. It creates a standard interface that works throughout completely different networking {hardware} by figuring out the core performance shared by numerous programs.

TransferEngine makes use of RDMA (Distant Direct Reminiscence Entry) expertise. This enables computer systems to switch information immediately between graphics playing cards with out involving the primary processor—consider it as a devoted categorical lane between chips.

Perplexity’s implementation achieved 400 gigabits per second throughput on each Nvidia ConnectX-7 and AWS EFA, matching current single-platform options. TransferEngine additionally helps utilizing a number of community playing cards per GPU, aggregating bandwidth for even quicker communication.

Source link

See also  AWS signs wind PPA with Avangrid in Oregon
TAGGED: costly, models, opensource, Perplexitys, Run, tool, Trillionparameter, upgrades
Share This Article
Twitter Email Copy Link Print
Previous Article Quantifying AI ROI in strategy Quantifying AI ROI in strategy
Next Article NYU’s new AI architecture makes high-quality image generation faster and cheaper NYU’s new AI architecture makes high-quality image generation faster and cheaper
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

AI in manufacturing set to unleash new era of profit

Manufacturing executives are wagering practically half their modernisation budgets on AI, betting these techniques will…

December 6, 2025

ASUS IoT leverages NVIDIA Jetson Orin to double GenAI performance for edge applications

ASUS IoT, edge AI computer systems now help “Supermode” for NVIDIA Jetson Orin NX and…

January 22, 2025

Bio-based fabric with integrated sensors continuously monitors asphalt road conditions

Industrial zone exams: As step one, the sensor cloth is put in throughout the complete…

October 4, 2025

Lessons Learned from a Data Center Fire

It’s 5am. You're properly tucked up in mattress. The cellphone rings. It’s a colleague explaining…

June 4, 2024

CoreNest Capital Invests in Texture Capital

NYC-based enterprise capital agency CoreNest Capital has introduced a strategic funding in Texture Capital Holdings, the father…

November 28, 2024

You Might Also Like

Cloud Hyperscaler Concept - Hyperscale Computing - Cloud Architecture that Scales with Increasing Demand - 3D Illustration
Global Market

Neoclouds gain momentum in a supply-constrained world

By saad
Stargate comes to the UK, with OpenAI, Nvidia and Nscale
Global Market

OpenAI puts Stargate UK on pause, cites ‘high energy costs’

By saad
open source digital screen
Global Market

New v2 UALink specification aims to catch up to NVLink

By saad
Could being a ‘good neighbour’ help secure grid access?
Global Market

Could being a ‘good neighbour’ help secure grid access?

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.