Wednesday, 25 Feb 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Global Market > Red Hat Execs on Inference Scaling and Open-Source AI
Global Market

Red Hat Execs on Inference Scaling and Open-Source AI

Last updated: July 20, 2025 8:45 am
Published July 20, 2025
Share
Red Hat Execs on Inference Scaling and Open-Source AI
SHARE

This week on ‘No Math AI,’ Akash and Isha go to the Pink Hat Summit to fulfill with Pink Hat CEO Matt Hicks and CTO Chris Wright. They speak concerning the sensible necessities of introducing inference time scaling, also referred to as take a look at time scaling/compute, to company customers across the globe.

Matt Hicks examines how an AI platform is important for abstracting complexity and bearing bills as AI evolves from static fashions to dynamic, agentic purposes. These purposes make in depth use of inference time scaling strategies, together with particle filtering and reasoning, which produce numerous tokens so as to enhance accuracy. Hicks highlights the need of platforms that cut back the unit price of those capabilities, make it easy for companies to implement such methods, and encourage confidence by providing price transparency so as to recover from the “concern response” associated to unexpected prices when doing extra inferencing.

Chris Wright describes the open-source AI roadmap for implementing these novel, inference-based applied sciences in manufacturing in a dependable method. Shifting from single-instance inference to a distributed infrastructure that may help a number of customers directly and successfully handle the large token manufacturing wanted by these scaled inference processes is likely one of the difficulties he addresses. Wright presents the brand new Pink Hat undertaking LLM-d, which goals to offer a distributed inference platform commonplace. By way of integration with Kubernetes, LLM-d seeks to enhance {hardware} utilization, handle distributed KV caches, and intelligently route requests relying on {hardware} wants. By way of cooperative open-source initiatives, the target is to offer repeatable blueprints for a shared structure to handle these inference-time-scaling workloads.

See also  Eaton eyes up solid-state transformers for data centres

An important impediment, as Hicks and Wright level out, is effectively increasing the underlying inference structure from single-server cases to a steady, distributed, and clear platform. For company AI to advance and for inference time scaling to be broadly used, this barrier should be addressed by means of group initiatives.

Source link

TAGGED: execs, Hat, Inference, opensource, Red, Scaling
Share This Article
Twitter Email Copy Link Print
Previous Article Motif Raises $46M in Seed and Series A Funding Blockskye Raises $15.8M in Funding
Next Article Receive Raises $4M in Seed Funding Receive Raises $4M in Seed Funding
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Arista laments ‘horrendous’ memory situation

Digging in on campus Arista has been clear about its plans to develop its presence…

February 15, 2026

Why Enterprises are Reopening their Wallets

After almost two years of financial uncertainty, enterprises have reopened their wallets, concentrating on safe entry…

July 13, 2025

World’s first real-time wearable human emotion recognition technology developed

Schematic illustration of the system overview with personalised skin-integrated facial interfaces (PSiFI). Credit score: UNIST…

February 24, 2024

Compact cooling pump drops temperatures by 16°F

A brand new UCLA-developed cooling gadget product of six skinny layers of nanotube-coated polymer movies…

January 19, 2025

The inference trap: How cloud providers are eating your AI margins

This text is a part of VentureBeat’s particular subject, “The Actual Price of AI: Efficiency,…

July 5, 2025

You Might Also Like

AI (Artificial Intelligence) technology, chip IC on PCB, PCB circuit board, microprocessor
Global Market

AMD strikes massive AI chip deal with Meta

By saad
Why data centre transformation must start with the customer
Global Market

Why data centre transformation must start with the customer

By saad
Experienced engineer writing data center disaster recovery plan on laptop to provide rapid restoration of service, limiting disruption and minimizing interruptions to normal operations
Global Market

From packets to prompts: What Cisco’s AITECH certification means for IT pros

By saad
Rethinking cooling strategy for the UK data centre market
Global Market

Rethinking cooling strategy for the UK data centre market

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.