Saturday, 11 Apr 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Edge Computing > EdgeCortix launches SAKURA-II platform to power the next wave of generative AI at the edge
Edge Computing

EdgeCortix launches SAKURA-II platform to power the next wave of generative AI at the edge

Last updated: May 23, 2024 10:40 am
Published May 23, 2024
Share
EdgeCortix launches SAKURA-II platform to power the next wave of generative AI at the edge
SHARE

EdgeCortix Inc., a fabless semiconductor firm specialising in energy-efficient AI processing on the edge, right now unveiled its next-generation SAKURA-II Edge AI accelerator.

This platform, paired with EdgeCortix’s modern second era Dynamic Neural Accelerator (DNA) structure, is engineered to deal with essentially the most difficult Generative AI duties within the business. Designed for flexibility and energy effectivity, SAKURA-II empowers customers to seamlessly handle a variety of advanced duties together with Massive Language Fashions (LLMs), Massive Imaginative and prescient Fashions (LVMs), and multi-modal transformer-based purposes, even inside the stringent environmental constraints on the edge.

That includes low latency, ‘best-in-class’ reminiscence bandwidth, excessive accuracy, and compact kind elements, SAKURA-II delivers unparalleled efficiency and cost-efficiency throughout the various spectrum of edge AI purposes.

Nicely-suited for quite a few use circumstances throughout the manufacturing, business 4.0, safety, robotics, aerospace, and telecommunications industries, SAKURA-II options EdgeCortix’s newest era runtime reconfigurable neural processing engine, DNA-II. Leveraging this extremely configurable mental property block, SAKURA-II delivers energy effectivity and real-time processing capabilities whereas concurrently executing a number of deep neural community fashions with low latency. SAKURA-II can ship as much as 60 trillion operations per second (TOPS) of efficient 8-bit integer efficiency and 30 trillion 16-bit mind floating-point operations per second (TFLOPS), whereas additionally supporting built-in blended precision for dealing with the rigorous calls for of next-generation AI duties.

The SAKURA-II platform, with its subtle MERA software program suite, incorporates a heterogeneous compiler platform, superior quantisation, and mannequin calibration capabilities. This software program suite contains native assist for main growth frameworks comparable to PyTorch, TensorFlow Lite, and ONNX. MERA’s versatile host-to-accelerator unified runtime is adept at scaling throughout single, multi-chip, and multi-card programs on the edge, considerably streamlining AI inferencing and shortening deployment occasions for information scientists. Moreover, the mixing with the MERA Mannequin Library, with seamless interface to Hugging Face Optimum, provides customers entry to an intensive vary of the most recent transformer fashions, making certain a easy transition from coaching to edge inference.

See also  Cisco, NVIDIA and Sharon AI bring hyperscale-class AI infrastructure onshore in Australia

Sakyasingha Dasgupta, CEO and founding father of EdgeCortix, mentioned: “SAKURA-II’s spectacular 60 TOPS efficiency inside 8W of typical energy consumption, mixed with its mixed-precision and built-in reminiscence compression capabilities, positions it as a pivotal expertise for the most recent Generative AI options on the edge.

“Whether or not working conventional AI fashions or the most recent Llama 2/3, Secure-diffusion, Whisper or Imaginative and prescient-transformer fashions, SAKURA-II supplies deployment flexibility at superior efficiency per watt and cost-efficiency. We’re dedicated to making sure we meet our buyer’s various wants and in addition to securing a technological basis that is still sturdy and adaptable inside the swiftly evolving AI sector.”

Key Advantages of SAKURA-II embrace:

  • Optimised for Generative AI: Tailor-made particularly for processing Generative AI workloads on the edge with minimal energy consumption.
  • Advanced Mannequin Dealing with: Able to managing multi-billion parameter fashions like Llama 2, Secure Diffusion, DETR, and ViT inside a typical energy envelope of 8W.
  • Seamless Software program Integration: Totally suitable with EdgeCortix’s MERA software program suite, facilitating seamless transitions from mannequin coaching to deployment.
  • Enhanced Reminiscence Bandwidth: Affords as much as 4 occasions extra DRAM bandwidth than competing AI accelerators, making certain superior efficiency for LLM and LVM.
  • Actual-Time Information Streaming: Optimised for low-latency operations below real-time information streaming circumstances.
  • Superior Precision: Supplies software-enabled mixed-precision assist for close to FP32 accuracy.
  • Sparse Computation: Helps sparse computation to scale back reminiscence footprint and optimise bandwidth.
  • Versatile Performance: Helps arbitrary activation capabilities with {hardware} approximation for enhanced adaptability.
  • Environment friendly Information Dealing with: Features a devoted Reshaper engine to handle advanced information permutations on-chip and minimise host CPU load.
  • Energy Administration: Options on-chip power-gating and energy administration capabilities to facilitate ultra-high effectivity modes.
See also  Latent AI debuts agentic platform to automate edge AI at scale

SAKURA-II can be supplied as a stand-alone system, two totally different M.2 modules with various DRAM capability, single and dual-device low-profile PCIe playing cards. Clients can reserve M.2 modules and PCIe playing cards right now for supply within the second half of 2024.

Need to be taught extra about edge computing from business leaders? Take a look at Edge Computing Expo going down in Amsterdam, California and London. 

Discover different upcoming enterprise expertise occasions and webinars powered by TechForge here.

Source link

TAGGED: edge, EdgeCortix, generative, launches, Platform, Power, SAKURAII, Wave
Share This Article
Twitter Email Copy Link Print
Previous Article Lenovo unveils Truscale Hybrid Cloud for edge to empower data-driven workloads Zenlayer launches port-based purchasing model for network management needs
Next Article Germany Data Center Market Trends and Analysis 2023-2028: Berlin, Hamburg, and Frankfurt Emerge as Hotspots for Investment with Alibaba Cloud's AI and Machine Learning Expansions US Data Center Construction Industry Report 2024-2029 – $47+ Billion Market is Booming Due to Demand from Hyperscale Companies, Artificial Intelligence Needs, and Growth of Edge Computing
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Steps to sustainable networking – Data Centre Review

Mattias Fridström, Chief Evangelist at Arelion, highlights how the sector could make significant progress in…

February 15, 2024

Kubernetes is (not) a cost optimization problem

Kubernetes has change into the de facto solution to schedule and handle providers in medium…

March 8, 2024

Critical Insight 2024: Sarah Peterson explores the regulatory future of data centres

To supply the perfect experiences, we use applied sciences like cookies to retailer and/or entry…

October 25, 2024

Cohere’s smallest, fastest R-series model excels at RAG, reasoning in 23 languages

Be part of our day by day and weekly newsletters for the most recent updates…

December 14, 2024

Digital Realty acquires Slough data centre campus

The newly acquired campus options two particular person information facilities with a mixed capability of…

July 12, 2024

You Might Also Like

Heat emission from the chimneys of a large data and server complex.
Global Market

OpenAI puts part of Stargate project on hold over runaway power costs

By saad
DDN and Zadara target sovereign AI deployments with multi-tenant NVIDIA factory stack
Edge Computing

DDN and Zadara target sovereign AI deployments with multi-tenant NVIDIA factory stack

By saad
Premio targets multi-camera edge AI with new Jetson Orin systems
Edge Computing

Premio targets multi-camera edge AI with new Jetson Orin systems

By saad
Server racks with illuminated indicators in a dimly lit data center.
Global Market

Aria Networks raises $125M, launches platform for AI factories

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.