Wednesday, 10 Dec 2025
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Global Market > Perplexity’s open-source tool to run trillion-parameter models without costly upgrades
Global Market

Perplexity’s open-source tool to run trillion-parameter models without costly upgrades

Last updated: November 9, 2025 11:40 am
Published November 9, 2025
Share
LLMs, ChatGPT, Generative AI
SHARE

The apparent reply can be Nvidia’s new GB200 programs, primarily one big 72-GPU server. However these value thousands and thousands, face excessive provide shortages, and aren’t accessible all over the place, the researchers famous. In the meantime, H100 and H200 programs are plentiful and comparatively low-cost.

The catch: operating giant fashions throughout a number of older programs has historically meant brutal efficiency penalties. “There are not any viable cross-provider options for LLM inference,” the analysis staff wrote, noting that current libraries both lack AWS assist totally or endure extreme efficiency degradation on Amazon’s {hardware}.

TransferEngine goals to alter that. “TransferEngine allows transportable point-to-point communication for contemporary LLM architectures, avoiding vendor lock-in whereas complementing collective libraries for cloud-native deployments,” the researchers wrote.

How TransferEngine works

TransferEngine acts as a common translator for GPU-to-GPU communication, in accordance with the paper. It creates a standard interface that works throughout completely different networking {hardware} by figuring out the core performance shared by numerous programs.

TransferEngine makes use of RDMA (Distant Direct Reminiscence Entry) expertise. This enables computer systems to switch information immediately between graphics playing cards with out involving the primary processor—consider it as a devoted categorical lane between chips.

Perplexity’s implementation achieved 400 gigabits per second throughput on each Nvidia ConnectX-7 and AWS EFA, matching current single-platform options. TransferEngine additionally helps utilizing a number of community playing cards per GPU, aggregating bandwidth for even quicker communication.

Source link

See also  Salesforce proves less is more: xLAM-1B 'Tiny Giant' beats bigger AI Models
TAGGED: costly, models, opensource, Perplexitys, Run, tool, Trillionparameter, upgrades
Share This Article
Twitter Email Copy Link Print
Previous Article Quantifying AI ROI in strategy Quantifying AI ROI in strategy
Next Article NYU’s new AI architecture makes high-quality image generation faster and cheaper NYU’s new AI architecture makes high-quality image generation faster and cheaper
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Alibaba Cloud unleashes over 100 open-source AI models

Alibaba Cloud has open-sourced greater than 100 of its newly-launched AI fashions, collectively often called…

September 20, 2024

Lindis Blood Care Raises Financing

Lindis Blood Care, a Hennigsdorf, Germany-based medical machine firm, raised an undisclosed quantity in funding.…

December 14, 2024

Kyndryl, AWS unwrap AI-driven mainframe migration service

Kyndryl (NYSE:KD) is tapping into agentic AI expertise from AWS to supply enterprise prospects one…

June 10, 2025

Codestone Acquires Cloud Business

Codestone, a Poole, UK-based enterprise useful resource planning (ERP) and cloud database applied sciences firm,…

February 18, 2024

IOTech to offer enhanced usability for open edge solutions via Edge Central 3.1 release

IOTech, an open edge computing answer supplier, has introduced the overall availability of Edge Central…

March 29, 2024

You Might Also Like

Faster field interventions with label printers built for mobility
Global Market

Faster field interventions with label printers built for mobility

By saad
Data center infrastructure with interconnected servers, cloud computing, and virtual networks. Vector isometric illustration for advanced IT systems, big data, and cloud storage.
Global Market

Short memory supply forces Micron to abandon consumer market, prioritize enterprise

By saad
Siemens, nVent develop blueprint for NVIDIA AI data centres
Global Market

Siemens, nVent develop blueprint for NVIDIA AI data centres

By saad
Nvidia high-performance chip technology
Global Market

US approves Nvidia H200 exports to China, raising questions about enterprise GPU supply

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.