Saturday, 15 Nov 2025
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Innovations > A Minecraft-based benchmark to train and test multi-modal multi-agent systems
Innovations

A Minecraft-based benchmark to train and test multi-modal multi-agent systems

Last updated: January 12, 2025 12:51 am
Published January 12, 2025
Share
A Minecraft-based benchmark to train and test multi-modal multi-agent systems
SHARE
Greater than 30 goal objects or assets are utilized in TeamCraft duties. Credit score: UCLA.

Researchers on the College of California- Los Angeles (UCLA) have not too long ago developed TeamCraft, a brand new open-world atmosphere for the coaching and analysis of algorithms for embodied synthetic intelligence (AI) brokers, together with groups of a number of robots. This benchmark, launched in a paper revealed on the arXiv preprint server, is predicated on the favored videogame Minecraft.

“There’s a lack of multi-modal, multi-agent benchmarks for open-world environments,” Qian Lengthy, Ph.D. a scholar at UCLA, informed Tech Xplore.

“Minecraft, probably the most well-liked video games, gives a multidimensional, visually immersive realm characterised by procedurally generated landscapes and versatile recreation mechanics. Its dynamic nature helps a variety of actions, which made it a perfect platform for creating our visually wealthy multi-agent benchmark: TeamCraft.”

TeamCraft, the platform created by Lengthy and his colleagues, can be utilized to coach algorithms on 4 various kinds of duties, particularly constructing, clearing, farming and smelting. As a part of their research, the researchers additionally used their platform to judge current vision-language fashions (VLMs), which allowed them to raised perceive their limitations.

“TeamCraft is a multi-modal, multi-agent benchmark that addresses a major problem for AI,” mentioned Zhi Li, Ph.D. Pupil at UCLA. “Particularly, it helps to deal with the query: How effectively can embodied brokers collaborate in advanced environments with human-like notion?”

A Minecraft-based benchmark to train and test multi-modal multi-agent systems
Brokers collaborate to prepare dinner mutton in a desert village. Credit score: UCLA.

Within the TeamCraft benchmarking platform, each agent is supplied with first-person RGB information and standing data, which mirrors what a human agent would understand within the atmosphere. AI brokers might be educated and examined on varied duties that require them to collaborate with one another, perceive the atmosphere through first-person imaginative and prescient and make the most of accessible instruments.

To finish every job, the brokers have to carry out particular actions, comparable to people who a human participant would carry out in Minecraft. These actions are pre-defined (i.e., might be picked from a restricted set of choices) and self-descriptive (i.e., clearly named/labeled).

See also  DeepMind’s Michelangelo benchmark reveals limitations of long-context LLMs

“The primary benefit of TeamCraft is that it allows multi-modal job specification,” defined Li. “In contrast to prior techniques equivalent to ALFRED and MineDojo, which rely solely on textual content directions, TeamCraft helps multi-modal prompts. This expands the scope for richer and extra various job specs.”

One other distinctive attribute of TeamCraft is that it equips brokers with first-person RGB imaginative and prescient whereas they navigate the visually wealthy Minecraft atmosphere. That is in distinction with earlier approaches equivalent to Watch&Assist and RoCoBench, which relied on state-based observations, Neural MMO 2.0, which supplies simplified pixel-based visuals, and Overcooked-AI, which solely permits brokers to view 2D worlds.

“Whereas most prior works like MineDojo and VIMA-Bench concentrate on single-agent setups, TeamCraft prioritizes multi-agent environments to raised simulate real-world challenges requiring collaboration,” mentioned Li.

“It helps each centralized and decentralized management methods, enhancing flexibility in agent coordination and difficult capabilities of mannequin understanding.”

The duties included in TeamCraft are designed to evaluate the brokers’ planning, coordination and execution whereas they navigate a dynamic setting.

In distinction with another benchmarks, like FurnMove, the system doesn’t solely help the analysis of brokers which can be equally succesful throughout duties, but in addition of brokers with totally different duties.

In different phrases, it permits customers to distribute totally different roles to totally different brokers in a staff, by offering them with distinct capabilities. It may also be used to coach and take a look at the brokers’ decision-making abilities in real-time and their adaptability to altering environments.

TeamCraft includes a complete of 55,000 job variants. These variants are outlined primarily based on varied elements, together with Biomes (i.e., distinct areas inside the open-world atmosphere), base blocks, job objectives, goal supplies, brokers counts and distinctive inventories.

“Working within the Minecraft atmosphere, TeamCraft allows brokers to understand, assume, and act like human gamers with out good data,” mentioned Li.

See also  New machine learning technique can detect microplastics

“In contrast to prior techniques that present brokers with full information (e.g., unseen teammate places), TeamCraft requires brokers to actively discover their environment. This shift fosters extra sensible behaviors and reduces dependence on artificially good information, enabling brokers to raised deal with real-world eventualities and cut back the hole of deploying fashions to actual world utility.”

The benchmark created by the researchers additionally consists of ‘plug-and-play’ interfaces. Which means that it may be used each to check current fashions or prepare new ones, all inside a single standardized atmosphere. It might additionally function a gym-like playground to coach reinforcement studying (RL) algorithms that help multi-agent collaboration.

“TeamCraft demonstrates the potential for vision-based multi-agent collaboration within the open-world online game Minecraft,” mentioned Ran Gong, former Ph.D. scholar at UCLA.

“Minecraft’s wealthy and procedurally generated world supplies a difficult but versatile platform to discover collaborative problem-solving, useful resource administration, and job execution amongst a number of AI brokers. By specializing in vision-based inputs, TeamCraft emphasizes how brokers can interpret advanced visible cues to make selections, coordinate actions, and obtain shared objectives, all with out counting on predefined guidelines.”

By working exams on TeamCraft, the researchers demonstrated the existence of knowledge scaling legal guidelines, that are a key facet of AI mannequin efficiency. These legal guidelines present that there’s a constant sample within the coaching of AI fashions, the place an agent’s capability to carry out advanced duties and coordinate with different brokers improves because the coaching information it has entry to will increase.

“This discovering means that probably the most promising avenues for creating a simpler and strong system is to scale up the quantity of high-quality coaching information,” mentioned Gong. “By leveraging bigger datasets, fashions can study richer patterns, adapt higher to various eventualities, and improve their collaborative capabilities.”

See also  Over a third of UK businesses dangerously unprepared for AI risks

Sooner or later, TeamCraft may very well be utilized by pc scientists worldwide to coach and consider their machine learning-based fashions. As well as, it may assist the design of recent AI-based general-purpose videogame characters, which may collaborate higher with different characters or help human gamers as they’re enjoying a recreation.

“By way of pure interactions, these AI brokers may also help human gamers strategize, resolve challenges, and revel in a extra participating gaming expertise,” mentioned Gong. “Such developments may redefine the position of AI in gaming, remodeling it into an clever teammate or assistant able to adapting to human conduct and preferences.”

The code underpinning the TeamCraft benchmark is open-source and might be downloaded on GitHub. The brand new benchmark may quickly encourage the event of different open-world environments to coach or take a look at AI brokers, which additionally help multi-modal multi-agent interactions.

“Presently, the brokers in TeamCraft depend on implicit communication to coordinate their actions,” added Xiaofeng Gao, former Ph.D. scholar at UCLA.

“Enabling the brokers to speak explicitly through pure language can be an attention-grabbing path to discover. Furthermore, we plan to make TeamCraft a testbed for human-AI collaboration by together with human gamers within the video games.”

Extra data:
Qian Lengthy et al, TeamCraft: A Benchmark for Multi-Modal Multi-Agent Programs in Minecraft, arXiv (2024). DOI: 10.48550/arxiv.2412.05255

Journal data:
arXiv


© 2025 Science X Community

Quotation:
A Minecraft-based benchmark to coach and take a look at multi-modal multi-agent techniques (2025, January 10)
retrieved 11 January 2025
from https://techxplore.com/information/2025-01-minecraft-based-benchmark-multi-modal.html

This doc is topic to copyright. Other than any honest dealing for the aim of personal research or analysis, no
half could also be reproduced with out the written permission. The content material is offered for data functions solely.



Source link

TAGGED: benchmark, Minecraftbased, multiagent, multimodal, Systems, test, train
Share This Article
Twitter Email Copy Link Print
Previous Article Timberlyne Therapeutics Raises $180M in Series A Funding Timberlyne Therapeutics Raises $180M in Series A Funding
Next Article Origis Energy Origis Energy Closes $415M Funding Package
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Divisive new data center rules clear Fairfax County Planning Commission

Arguments for and towards the proposal Final week’s public listening to was attended by about…

June 10, 2024

Payroc Completes Acquisition of Agilisa Technologies

Payroc, a Tinley Park, IL-based service provider acquirer processor and fee integrations firm, accomplished the…

August 11, 2025

Why the entertainment industry needs an AI roadmap

Tim Levy, Founder and CEO of Twyn, points an pressing warning to the leisure sector:…

April 29, 2024

Perle Raises $9M in Seed Funding

Perle, a San Francisco, CA-based supplier of an AI coaching information platform empowered by Web3,…

August 9, 2025

Verizon Business brings multi-cloud management solution to network-as-a-service offering

Verizon Business today announced Network as a Service (NaaS) Cloud Management, a new service that…

February 9, 2024

You Might Also Like

AI-powered LED system delivers stable wireless power for indoor IoT devices
Innovations

AI-powered LED system delivers stable wireless power for indoor IoT devices

By saad
6g connectivity
Innovations

Europe’s push for 6G connectivity that serves society

By saad
Novel 3D nanofabrication techniques enable miniaturized robots
Innovations

Novel 3D nanofabrication techniques enable miniaturized robots

By saad
AI and the fight against money laundering: When algorithms chase shadows
Innovations

AI and the fight against money laundering

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.