Saturday, 28 Feb 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Cloud Computing > How cloud providers are tackling GPU shortages with custom chips
Cloud Computing

How cloud providers are tackling GPU shortages with custom chips

Last updated: December 6, 2024 7:40 pm
Published December 6, 2024
Share
How cloud providers are tackling GPU shortages with custom chips
SHARE

GPUs are the spine of AI computing, however as demand exceeds provide, cloud suppliers are getting artistic.

As a substitute of ready for extra GPUs, as Network World reported, they’re creating customized chips to fulfill particular workloads, delivering quicker, extra environment friendly computing whereas maintaining prices below management.

The competitors is heating up. At Microsoft’s Ignite convention final week, the corporate unveiled two new chips designed to spice up efficiency for its Azure platform. All eyes are actually on AWS, because it gears up for its personal, customized silicon portfolio.

Why customized chips matter

GPUs have revolutionised duties like coaching AI fashions, however they’re not at all times one of the best instrument for the job. They arrive with vital drawbacks: excessive energy consumption, intensive cooling wants, and, proper now, a worldwide scarcity. Nvidia’s newest GPUs stock is spoken for, for the following 12 months.

Customized accelerators are stepping in to fill the hole. Mario Morales, vp analyst at IDC, highlights the rising significance of options to GPUs: “These accelerators have gotten more and more necessary in cloud infrastructure attributable to their superior price-performance and price-efficiency ratios, which result in higher return on investments.”

AWS and Google have been rolling out customized chips for years—AWS with Trainium and Inferentia, and Google with Tensor Processing Items (TPUs). Microsoft, nonetheless, was late to hitch the customized silicon pattern. It wasn’t till final 12 months that the corporate launched its first customized chips, Maia and Cobalt, geared toward bettering power effectivity and dealing with AI workloads.

See also  Reigniting the European digital economy's €200bn AI ambitions

This 12 months, Microsoft has stepped up its sport, introducing two new chips:

  • Azure Increase DPU: Designed to optimise information processing by operating a customized working system.
  • Azure Built-in HSM: Centered on safety, it retains encryption and signing keys securely in {hardware}.

Microsoft’s Azure Increase DPU is a step ahead, nevertheless it nonetheless lags behind opponents within the DPU area. Forrester senior analyst Alvin Nguyen notes that Google’s E2000 IPU, co-developed with Intel, and AWS’s Nitro system are each already well-established. Different cloud suppliers, together with Nvidia with its Bluefield chips and AMD with Pensando, are jockeying for place.

That stated, Microsoft is making notable developments in infrastructure. The corporate introduced new liquid-cooling options for AI servers and a power-efficient rack design co-developed with Meta, which may pack 35% extra AI accelerators into every rack.

Safety will get a customized enhance

Safety is one other space the place customized silicon is making progress. Microsoft’s new HSM chip is a devoted resolution for encryption duties that will historically require a mixture of {hardware} and software program. Nguyen notes this strategy reduces latency and enhances scalability, making it an addition value contemplating.

AWS and Google are additionally utilizing customized chips for safety. AWS Nitro prevents major system CPUs from modifying firmware, and Google’s Titan establishes ‘a safe root of belief’ for validating system well being.

Every supplier has its personal strategy, Nguyen explains. “Whereas Nitro gives the important safety operate of making certain that the primary system CPUs can not replace firmware in naked steel mode, Titan gives a hardware-based root of belief that establishes the robust id of a machine, with which we will make necessary safety selections and validate the well being of the system.”

See also  Kubernetes 1.35 enables zero-downtime resource scaling for production cloud workloads

The way forward for customized chips within the cloud

The push for customized silicon isn’t slowing. Based on Alexander Harrowell, principal analyst at Omdia, it’s a logical transfer for hyperscalers to spend money on these chips to cut back prices and enhance effectivity.

Because the demand for quicker, extra specialised computing grows, customized chips are a technique for cloud suppliers to remain aggressive. With innovation in overdrive, the race to redefine cloud efficiency is simply beginning.

(Picture by Unsplash)

See additionally: IBM needs Nvidia GPUs, and AWS may be the reply

Need to be taught extra about cybersecurity and the cloud from trade leaders? Try Cyber Security & Cloud Expo happening in Amsterdam, California, and London. Discover different upcoming enterprise know-how occasions and webinars powered by TechForge here.

Tags: AI, cloud, microsoft

Source link

TAGGED: Chips, cloud, custom, GPU, Providers, shortages, Tackling
Share This Article
Twitter Email Copy Link Print
Previous Article Meta launches Llama 3.3, shrinking powerful 405B open model Meta launches Llama 3.3, shrinking powerful 405B open model
Next Article Pepeto and Pepe Unchained Compete for Dominance in the Next Memecoin Era Pepeto and Pepe Unchained Compete for Dominance in the Next Memecoin Era
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

AI’s challenge to Internet freedom

Julius Černiauskas, CEO at Oxylabs, explores how, while AI may have its threats, we can…

February 13, 2024

Data centre energy storage market is projected to reach $4.3 bn by 2034

Because the demand for digital companies surges, information facilities are underneath growing stress to undertake…

May 5, 2025

EU AI Act: How CIOs Can Prepare | DCN

The European Union has handed its sweeping AI Act, which is able to set up…

March 25, 2024

CoreWeave sets AI infrastructure benchmark with NVIDIA GB300 NVL72 rollout

CoreWeave grew to become the primary AI GPU cloud supplier to deploy NVIDIA GB300 NVL72…

July 10, 2025

Network Security to Reach $38B by 2029 on SaaS, Cloud Growth

The worldwide community safety market is on monitor to develop from $24 billion in 2024…

August 7, 2025

You Might Also Like

ASML's high-NA EUV tools clear the runway for next-gen AI chips
AI

ASML’s high-NA EUV tools clear the runway for next-gen AI chips

By saad
What is Famous Labs? Building an autonomous creation ecosystem
Cloud Computing

What is Famous Labs? Building an autonomous creation ecosystem

By saad
ControlMonkey extends configuration disaster recovery to cloud network vendors
Global Market

ControlMonkey extends configuration disaster recovery to cloud network vendors

By saad
Thomson Reuters, RBC embed AI into enterprise cloud workflows
Cloud Computing

Thomson Reuters, RBC embed AI into enterprise cloud workflows

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.