Monday, 9 Feb 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Cloud Computing > There’s more to cloud architecture than GPUs
Cloud Computing

There’s more to cloud architecture than GPUs

Last updated: April 4, 2024 9:00 am
Published April 4, 2024
Share
There
SHARE

Discuss to anyone about generative AI within the cloud, and the dialog goes shortly to GPUs (graphics processing models). However that could possibly be a false goal. GPUs don’t matter as a lot as individuals suppose they do, and in a number of years, the dialog will probably shift to what’s way more crucial to the event and deployment of generative AI techniques within the cloud.

The present assumption is that GPUs are indispensable for facilitating the complicated computations required by generative AI fashions. Whereas GPUs have been pivotal in advancing AI, overemphasizing them would possibly detract from exploring and leveraging equally efficient and doubtlessly extra sustainable alternate options. Certainly, GPUs might shortly turn out to be commodities like different assets that AI techniques want, equivalent to storage and processing area. The main focus ought to be on designing and deploying these techniques, not simply the {hardware} they run on. Name me loopy.

GPU gold rush

The significance of GPUs has labored out properly for Nvidia, an organization most individuals didn’t pay a lot consideration to till now. In its most up-to-date quarter, Nvidia posted record-high information heart income of $14.5 billion, up 41% from the prior quarter and 279% from the year-ago quarter. Its GPUs are actually the usual in AI processing, much more so than gaming.

Greater than the explosion of the Nvidia inventory, you may’t open social media with out seeing any individual taking a selfie with Jensen Huang, Nvidia’s CEO. Furthermore, everybody who’s anybody has partnered with Nvidia, working multimillion-dollar budgets to get near this high-growth firm and expertise.

Initially designed for accelerating 3D graphics in gaming within the Nineteen Nineties, GPUs have advanced from their origins. Early GPU structure was extremely specialised for graphical calculations and used primarily for rendering photos and dealing with the intensive parallel processing duties related to 3D rendering. This makes them a great match for AI since they’re adept at duties requiring simultaneous computations.

See also  OVHcloud US Debuts Managed Databases for Simplified Cloud Database Management

Are GPUs actually an enormous deal?

GPUs require a bunch chip to orchestrate operations. Though this simplifies the complexity and functionality of contemporary GPU architectures, it’s additionally much less environment friendly than it could possibly be. GPUs function along side CPUs (the host chip), which offload particular duties to GPUs. Additionally, these host chips handle the general operation of software program packages.

Including to this query of effectivity is the need for inter-process communications; challenges with disassembling fashions, processing them in components, after which reassembling the outputs for complete evaluation or inference; and the complexities inherent in utilizing GPUs for deep studying and AI. This segmentation and reintegration course of is a part of distributing computing duties to optimize efficiency, but it surely comes with its personal effectivity questions.

Software program libraries and frameworks designed to summary and handle these operations are required. Applied sciences like Nvidia’s CUDA (Compute Unified System Structure) present the programming mannequin and toolkit wanted to develop software program that may harness GPU acceleration capabilities.

A core cause for the excessive curiosity in Nvidia is that it offers a software program ecosystem that permits GPUs to work extra effectively with purposes, together with gaming, deep studying, and generative AI. With out these ecosystems, CUDA and others wouldn’t have the identical potential. Thus, the highlight is on Nvidia, which has each the processor and the ecosystem for now.

Alternate options on the horizon

I’m not saying that Nvidia GPUs are dangerous expertise. Clearly they’re efficient. The argument is that having the processing layer be the foremost focus of constructing and deploying generative AI techniques within the cloud is a little bit of a distraction.

See also  Alibaba Cloud targets global AI growth with new models and tools

I think that in two years, GPUs will definitely nonetheless be within the image, however the pleasure about them may have lengthy handed. As an alternative, we’ll be targeted on inference effectivity, steady mannequin enchancment, and new methods to handle algorithms and information.

The meteoric rise of Nvidia has buyers working for his or her checkbooks to spend money on any potential alternate options to play in that market. Obvious opponents proper now are AMD and Intel. Intel, for instance, is pursuing a GPU different with its Gaudi 3 processor. Extra curiously, a number of startups purport to have created higher methods to course of massive language fashions. A brief listing of those firms contains SambaNova, Cerebras, GraphCore, Groq, and xAI.

In fact, not solely are these firms seeking to construct chips and software program ecosystems for these chips, many are working to supply microclouds or small cloud suppliers that can supply their GPU alternate options as a service, very similar to AWS, Microsoft, and Google do immediately with obtainable GPUs. The listing of GPU cloud suppliers is rising by the day, judging from the variety of PR companies banging on my door for consideration.

Whereas we’re simply reselling Nvidia GPU processing, you may depend on these identical microclouds to undertake new GPU analogs as they hit the market, contemplating that they’re cheaper, extra environment friendly, and require much less energy. If that happens, they are going to shortly substitute no matter processor is much less superior. What’s extra, if the efficiency and reliability are there, we actually don’t care what model the processor is, and even the structure that it employs. In that world, I doubt we’ll be in search of selfies with the CEOs of these firms. It’s only a element of a system that works.

See also  How to deploy software to Linux-based IoT devices at scale

Typically GPUs should not wanted

In fact, as I coated right here, GPUs should not at all times wanted for generative AI or different AI processing. Smaller fashions would possibly run effectively on conventional CPUs or different specialised {hardware} and be extra cost- and energy-efficient.

Lots of my generative AI architectures have used conventional CPUs with no important affect on efficiency. In fact, it will depend on what you’re making an attempt to do. Most enterprise generative AI deployments would require much less energy, and I think that most of the present generative AI initiatives that insist on utilizing GPUs are sometimes overkill.

Ultimately we’ll get higher at understanding when GPUs (or their analogs) ought to be used and when they don’t seem to be wanted. Nevertheless, very similar to we’re seeing with the cloud-flation on the market, enterprises might overprovision the processing energy for his or her AI techniques and received’t care till they see the invoice. We now have not reached the purpose the place we’re too frightened about the price optimization of generative AI techniques, however we must be accountable in some unspecified time in the future.

Okay, Linthicum is being a buzzkill once more. I assume I’m, however for good cause. We’re about to enter a time of a lot change and transformation in the usage of AI expertise that can affect IT shifting ahead. What retains me up at night time is that the IT trade is being distracted by one other shiny object. That usually doesn’t finish properly.

Copyright © 2024 IDG Communications, .

Contents
GPU gold rushAre GPUs actually an enormous deal?Alternate options on the horizonTypically GPUs should not wanted

Source link

TAGGED: architecture, cloud, GPUs
Share This Article
Twitter Email Copy Link Print
Previous Article These Dividend Stocks Are Joining Forces in a $7 Billion Deal to Capture a $1 Trillion Once-in-a-Generation Opportunity These Dividend Stocks Are Joining Forces in a $7 Billion Deal to Capture a $1 Trillion Once-in-a-Generation Opportunity
Next Article Developing a More Responsible Approach to AI Developing a More Responsible Approach to AI
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Why Microsoft Fabric has already been adopted by 70% of the Fortune 500 — and what’s next

Be a part of our day by day and weekly newsletters for the most recent…

May 27, 2025

Samsung, Vodafone complete Open RAN data call; telecom giants form consortium

Samsung Electronics, Vodafone and AMD have collaborated to efficiently exhibit an end-to-end name utilizing the…

February 16, 2024

CarbonBlue Raises $10M in Funding

The CarbonBlue crew. Credit score: CarbonBlue CarbonBlue, a Haifa, Israel-based climate-tech startup, accomplished a seed funding…

July 23, 2024

DRC inaugurates $30 million Raxio Data Centre

In a landmark transfer reinforcing the Democratic Republic of the Congo’s (DRC) burgeoning function in…

August 23, 2024

Intel: Latest news and insights

Intel is hoping for a turnaround underneath its new CEO, Lip-BuTan.  Intel’s Q1 2025 income was…

July 15, 2025

You Might Also Like

Shutterstock Germany Only - News - Intel Factory Germany September 2024
Global Market

Intel sets sights on data center GPUs amid AI-driven infrastructure shifts

By saad
Alphabet boosts cloud investment to meet rising AI demand
Cloud Computing

Alphabet boosts cloud investment to meet rising AI demand

By saad
On how to get a secure GenAI rollout right
Cloud Computing

On how to get a secure GenAI rollout right

By saad
Snowflake and OpenAI push AI into everyday cloud data work
Cloud Computing

Snowflake and OpenAI push AI into everyday cloud data work

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.