Friday, 10 Apr 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Researchers improved AI agent performance on unfamiliar tasks using ‘Dungeons and Dragons’
AI

Researchers improved AI agent performance on unfamiliar tasks using ‘Dungeons and Dragons’

Last updated: January 12, 2025 1:59 am
Published January 12, 2025
Share
Researchers improved AI agent performance on unfamiliar tasks using 'Dungeons and Dragons'
SHARE

Be part of our every day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Study Extra


Organizations serious about deploying AI brokers should first fine-tune them, particularly in workflows that always really feel rote. Whereas some organizations need brokers that solely carry out one sort of process in a single workflow, generally brokers should be introduced into new environments with the hope that they adapt. 

Researchers from the Beijing University of Posts and Telecommunications have unveiled a brand new technique, AgentRefine. It teaches brokers to self-correct, resulting in extra generalized and adaptive AI brokers. 

The researchers stated that present tuning strategies restrict brokers to the identical duties as their coaching dataset, or “held-in” duties, and don’t carry out as effectively for “held-out,” or new environments. By following solely the foundations laid out by means of the coaching knowledge, brokers skilled with these frameworks would have bother “studying” from their errors and can’t be made into basic brokers and introduced into to new workflows. 

To fight that limitation, AgentRefine goals to create extra generalized agent coaching datasets that allow the mannequin to study from errors and match into new workflows. In a new paper, the researchers stated that AgentRefine’s aim is “to develop generalized agent-tuning knowledge and set up the correlation between agent generalization and self-refinement.” If brokers self-correct, they won’t perpetuate any errors they discovered and convey these identical errors to different environments they’re deployed in. 

“We discover that agent-tuning on the self-refinement knowledge enhances the agent to discover extra viable actions whereas assembly dangerous conditions, thereby leading to higher generalization to new agent environments,” the researchers write. 

See also  RSAC 2025: Why the AI agent era means more demand for CISOS

AI agent coaching impressed by D&D

Taking their cue from the tabletop roleplaying recreation Dungeons & Dragons, the researchers created personas, scripts for the agent to comply with and challenges. And sure, there’s a Dungeon Grasp (DM). 

They divided knowledge building for AgentRefine into three areas: script technology, trajectory technology and verification. 

In script technology, the mannequin creates a script, or information, with info on the setting, duties and actions personas can take. (The researchers examined AgentRefine utilizing Llama-3-8B-Instruct, Llama-3-70B-Instruct, Mistral-7B-Instruct-v0.3, GPT-4o-mini and GPT-4o)

The mannequin then generates agent knowledge that has errors and acts each as a DM and a participant through the trajectory stage. It asses the actions it might take after which see if these comprise errors. The final stage, verification, checks the script and trajectory, permitting for the potential of brokers it trains to do self-correction.

Higher and extra numerous process talents

The researchers discovered that brokers skilled utilizing the AgentRefine technique and dataset carried out higher on numerous duties and tailored to new eventualities. These brokers self-correct extra to redirect their actions and decision-making to keep away from errors, and grow to be extra sturdy within the course of. 

Particularly, AgentRefine improved the efficiency of all of the fashions to work on held-out duties. 

Enterprises should make brokers extra task-adaptable in order that they don’t repeat solely what they’ve discovered to allow them to grow to be higher decision-makers. Orchestrating brokers not solely “direct site visitors” for a number of brokers but in addition decide whether or not brokers have accomplished duties primarily based on consumer requests. 

See also  Meta researchers distill System 2 thinking into LLMs, improving performance on complex reasoning

OpenAI’s o3 provides “program synthesis” which may enhance process adaptability. Different orchestration and coaching frameworks, like Magentic-One from Microsoft, units actions for supervisor brokers to study when to maneuver duties to totally different brokers. 


Source link
TAGGED: Agent, Dragons, Dungeons, improved, performance, researchers, tasks, unfamiliar
Share This Article
Twitter Email Copy Link Print
Previous Article Origis Energy Origis Energy Closes $415M Funding Package
Next Article Scott Fanning (Aryaka) Scott Fanning (Aryaka) – HostingJournalist.com
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Hypernatural Raises $9.2M in Funding

Hypernatural, a NYC-based AI video creator, raised $9.2M in funding, throughout 2 rounds. The rounds…

July 23, 2025

10th District candidates back federal laws for data centers | News

State AlabamaAlaskaArizonaArkansasCaliforniaColoradoConnecticutDelawareFloridaGeorgiaHawaiiIdahoIllinoisIndianaIowaKansasKentuckyLouisianaMaineMarylandMassachusettsMichiganMinnesotaMississippiMissouriMontanaNebraskaNevadaNew HampshireNew JerseyNew MexicoNew YorkNorth CarolinaNorth DakotaOhioOklahomaOregonPennsylvaniaRhode IslandSouth CarolinaSouth DakotaTennesseeTexasUtahVermontVirginiaWashingtonWashington D.C.West VirginiaWisconsinWyomingPuerto RicoUS Virgin…

June 15, 2024

StorPool Announces Disaster Recovery Engine for KVM-based Clouds

Information storage software program firm StorPool Storage has unveiled a Catastrophe Restoration Engine for KVM-based…

February 13, 2025

Data Orchestration: Performance Is Key to a Global Data Environment | DCN

Successfully managing high-performance workloads calls for an equally high-performance infrastructure. Sadly, the everyday information administration…

March 14, 2024

Netskope expands ZTNA with device intelligence for IoT/OT environments

Netskope this week introduced it had up to date its common zero-trust community entry (ZTNA)…

October 12, 2025

You Might Also Like

Agentic AI's governance challenges under the EU AI Act in 2026
AI

Agentic AI’s governance challenges under the EU AI Act in 2026

By saad
Anthropic keeps new AI model private after it finds thousands of external vulnerabilities
AI

Anthropic keeps new AI model private after it finds thousands of external vulnerabilities

By saad
Microsoft open-source toolkit secures AI agents at runtime
AI

Microsoft open-source toolkit secures AI agents at runtime

By saad
AI workflows for software developers and the need for oversight
AI

AI workflows for software developers and the need for oversight

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.