Monday, 12 Jan 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > AI21 CEO says transformers not right for AI agents due to error perpetuation
AI

AI21 CEO says transformers not right for AI agents due to error perpetuation

Last updated: October 12, 2024 3:26 am
Published October 12, 2024
Share
Former Meta engineers launch Jace AI that works independently
SHARE

Be part of our every day and weekly newsletters for the most recent updates and unique content material on industry-leading AI protection. Be taught Extra


As extra enterprise organizations look to the so-called agentic future, one barrier could also be how AI fashions are constructed. For enterprise AI developer A121, the reply is obvious, the {industry} must look to different mannequin architectures to allow extra environment friendly AI brokers. 

Ari Goshen, AI21 CEO, mentioned in an interview with VentureBeat that Transformers, the preferred mannequin structure, has limitations that might make a multi-agent ecosystem troublesome.

“One development I’m seeing is the rise of architectures that aren’t Transformers, and these various architectures can be extra environment friendly,” Goshen mentioned. “Transformers operate by creating so many tokens that may get very costly.” 

AI21, which focuses on growing enterprise AI options, has made the case earlier than that Transformers needs to be an choice for mannequin structure however not the default. It’s growing basis fashions utilizing its JAMBA structure, brief for Joint Consideration and Mamba structure. It’s primarily based on the Mamba structure developed by researchers from Princeton College and Carnegie Mellon College, which may supply sooner inference instances and longer context. 

Goshen mentioned various architectures, like Mamba and Jamba, can typically make agentic buildings extra environment friendly and, most significantly, reasonably priced. For him, Mamba-based fashions have higher reminiscence efficiency, which might make brokers, significantly brokers that hook up with different fashions, work higher. 

He attributes the rationale why AI brokers are solely now gaining recognition — and why most brokers haven’t but gone into product — to the reliance on LLMs constructed with transforms. 

See also  Executive Interview: ispmanager CEO on Hosting Panel Competition

“The primary cause brokers aren’t in manufacturing mode but is reliability or the dearth of reliability,” Goshen mentioned. “Once you break down a transformer mannequin, you recognize it’s very stochastic, so any errors will perpetuate.”

Enterprise brokers are rising in recognition

AI brokers emerged as one of many largest traits in enterprise AI this 12 months. A number of firms launched AI brokers and platforms to make it straightforward to construct brokers. 

ServiceNow introduced updates to its Now Help AI platform, together with a library of AI brokers for patrons. Salesforce has its secure of brokers referred to as Agentforce whereas Slack has begun permitting customers to combine brokers from Salesforce, Cohere, Workday, Asana, Adobe and extra. 

Goshen believes that this development will develop into much more fashionable with the right combination of fashions and mannequin architectures. 

“Some use circumstances that we see now, like query and solutions from a chatbot, are mainly glorified search,” he mentioned. “I feel actual intelligence is in connecting and retrieving completely different data from sources.”

Goshen added that AI21 is within the means of growing choices round AI brokers.

Different architectures vying for consideration

Goshen strongly helps various architectures like Mamba and AI21’s Jamba, primarily as a result of he believes transformer fashions are too costly and unwieldy to run. 

As a substitute of an consideration mechanism that kinds the spine of transformer fashions, Mamba can prioritize completely different knowledge and assign weights to inputs, optimize reminiscence utilization, and use a GPU’s processing energy. 

Mamba is rising in recognition. Different open-source and open-weight AI builders have begun releasing Mamba-based fashions up to now few months. Mistral launched Codestral Mamba 7B in July, and in August, Falcon got here out with its personal Mamba-based mannequin, Falcon Mamba 7B.  

See also  Microsoft AutoGen v0.4: A turning point toward more intelligent AI agents for enterprise developers

Nonetheless, the transformer structure has develop into the default, if not normal, selection when growing basis fashions. OpenAI’s GPT is, after all, a transformer mannequin—it’s actually in its identify—however so are most different fashionable fashions. 

Goshen mentioned that, finally, enterprises need whichever strategy is extra dependable. However organizations should even be cautious of flashy demos promising to unravel a lot of their issues. 

“We’re on the section the place charismatic demos are straightforward to do, however we’re nearer to that than to the product section,” Goshen mentioned. “It’s okay to make use of enterprise AI for analysis, but it surely’s not but on the level the place enterprises can use it to tell selections.”


Source link
TAGGED: agents, AI21, CEO, due, error, perpetuation, transformers
Share This Article
Twitter Email Copy Link Print
Previous Article New algorithm helps read QR codes on uneven surfaces New algorithm helps read QR codes on uneven surfaces
Next Article Bulk appoints Chief Development Officer Bulk appoints Chief Development Officer
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Inside NetSuite’s next act: Evan Goldberg on the future of AI-powered business systems

Introduced by Oracle NetSuiteWhen Evan Goldberg began NetSuite in 1998, his imaginative and prescient was…

December 7, 2025

Enhancing Performance and Storage Flexibility in Hybrid Cloud Environments

In a latest webinar hosted by PhoenixNAP in collaboration with Hewlett Packard Enterprise (HPE), the…

May 21, 2024

Scalable method creates self-healing, stretchable transistors and circuits

The self-healing and stretchable digital modules include three most important elements: a tactile sensor array…

June 4, 2025

Meet Hermes 3, a powerful new AI model that has existential crises

Be a part of our day by day and weekly newsletters for the newest updates…

August 17, 2024

Frankie Woodhead, Thrive: Why neurodiverse input is crucial for AI development

AI is shaping the long run, however is it actually designed for everybody? Frankie Woodhead,…

March 25, 2025

You Might Also Like

Autonomy without accountability: The real AI risk
AI

Autonomy without accountability: The real AI risk

By saad
Cisco building exterior with sign
Global Market

Cisco routers knocked out due to Cloudflare DNS change

By saad
The future of personal injury law: AI and legal tech in Philadelphia
AI

The future of personal injury law: AI and legal tech in Philadelphia

By saad
How AI code reviews slash incident risk
AI

How AI code reviews slash incident risk

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.