Tuesday, 14 Apr 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Edge Computing > Fastly’s AI accelerator tackles generative AI bottlenecks with 9x faster response times
Edge Computing

Fastly’s AI accelerator tackles generative AI bottlenecks with 9x faster response times

Last updated: December 19, 2024 2:02 am
Published December 19, 2024
Share
Fastly’s AI accelerator tackles generative AI bottlenecks with 9x faster response times
SHARE

International edge cloud platforms supplier Fastly has launched the Fastly AI Accelerator, a semantic caching answer geared toward enhancing efficiency and decreasing prices for builders utilizing Massive Language Mannequin (LLM) generative AI functions.

The AI Accelerator delivers a mean of 9x sooner response occasions in comparison with conventional strategies. Initially supporting OpenAI ChatGPT, it now additionally contains Microsoft Azure AI Foundry.

Builders can simply implement the AI Accelerator by updating their utility to a brand new API endpoint, typically requiring only a single line of code change.

The answer reduces the necessity for repeated API calls to AI suppliers, enhancing efficiency and person expertise.

“Fastly AI Accelerator is a big step in direction of addressing the efficiency bottleneck accompanying the generative AI increase,” says Dave McCarthy, Analysis Vice President, Cloud and Edge Providers at IDC. “This transfer solidifies Fastly’s place as a key participant within the fast-evolving edge cloud panorama. The distinctive strategy of utilizing semantic caching to cut back API calls and prices unlocks the true potential of LLM generative AI apps with out compromising on velocity or effectivity, permitting Fastly to reinforce the person expertise and empower builders.”

Current Fastly clients can entry the AI Accelerator immediately by means of their accounts.

Associated

AI  |  Fastly  |  generative AI  |  LLM  |  semantic caching

Source link

See also  Hailo unveils edge AI chips for generative AI and automotive innovations at CES 2025
TAGGED: accelerator, bottlenecks, faster, Fastlys, generative, response, tackles, times
Share This Article
Twitter Email Copy Link Print
Previous Article Plume Plume Raises $20M in Series A Funding
Next Article Beyond LLMs: How SandboxAQ's large quantitative models could optimize enterprise AI Beyond LLMs: How SandboxAQ’s large quantitative models could optimize enterprise AI
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Aramco Digital and Intel to establish Saudi Arabia’s first Open RAN development centre

Aramco Digital and Intel plan to establish Saudi Arabia’s inaugural Open RAN (Radio Access Network)…

January 22, 2024

Better Futures Raises €500K in Funding

Pictured at NovaUCD in Dublin is Anthony Mc Loughlin, CEO and Founder, Higher Futures. (Credit…

June 27, 2024

Blackstone Launches UK Venture, Exploring the Industry’s Power Problem

With information middle information shifting quicker than ever, we wish to make it simple for…

September 27, 2024

voize Raises $9M in Seed Funding

voize, a Berlin, Germany-based AI startup creating speech recognition to speed up digital documentation processes,…

April 1, 2025

Strengthening Our Core: Welcoming Karyne Levy as VentureBeat’s New Managing Editor

I’m thrilled to announce a implausible new addition to our management workforce: Karyne Levy is…

November 4, 2025

You Might Also Like

Lambda doubles down on NVIDIA stack with 10,000+ Blackwell GPUs and CPO networking push
Edge Computing

Lambda doubles down on NVIDIA stack with 10,000+ Blackwell GPUs and CPO networking push

By saad
DDN and Zadara target sovereign AI deployments with multi-tenant NVIDIA factory stack
Edge Computing

DDN and Zadara target sovereign AI deployments with multi-tenant NVIDIA factory stack

By saad
Premio targets multi-camera edge AI with new Jetson Orin systems
Edge Computing

Premio targets multi-camera edge AI with new Jetson Orin systems

By saad
Hosted.ai raises $19M to tackle GPU underutilization and reshape AI infrastructure economics
Edge Computing

Hosted.ai raises $19M to tackle GPU underutilization and reshape AI infrastructure economics

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.