Sunday, 14 Dec 2025
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Meta unveils five AI models for multi-modal processing, music generation, and more
AI

Meta unveils five AI models for multi-modal processing, music generation, and more

Last updated: June 19, 2024 8:47 pm
Published June 19, 2024
Share
Meta unveils five AI models for multi-modal processing, music generation, and more
SHARE

Meta has unveiled 5 main new AI fashions and analysis, together with multi-modal techniques that may course of each textual content and pictures, next-gen language fashions, music era, AI speech detection, and efforts to enhance variety in AI techniques.

The releases come from Meta’s Basic AI Analysis (FAIR) group which has centered on advancing AI by way of open analysis and collaboration for over a decade. As AI quickly innovates, Meta believes working with the worldwide neighborhood is essential.

“By publicly sharing this analysis, we hope to encourage iterations and in the end assist advance AI in a accountable approach,” mentioned Meta.

Chameleon: Multi-modal textual content and picture processing

Among the many releases are key parts of Meta’s ‘Chameleon’ fashions beneath a analysis license. Chameleon is a household of multi-modal fashions that may perceive and generate each textual content and pictures concurrently—not like most massive language fashions that are sometimes unimodal.

“Simply as people can course of the phrases and pictures concurrently, Chameleon can course of and ship each picture and textual content on the identical time,” defined Meta. “Chameleon can take any mixture of textual content and pictures as enter and likewise output any mixture of textual content and pictures.”

Potential use circumstances are nearly limitless from producing inventive captions to prompting new scenes with textual content and pictures.

Multi-token prediction for sooner language mannequin coaching

Meta has additionally launched pretrained fashions for code completion that use ‘multi-token prediction’ beneath a non-commercial analysis license. Conventional language mannequin coaching is inefficient by predicting simply the following phrase. Multi-token fashions can predict a number of future phrases concurrently to coach sooner.

See also  LinkedIn founder Reid Hoffman unveils ‘super agency’ vision at TED AI conference, takes subtle shot at Elon Musk

“Whereas [the one-word] method is straightforward and scalable, it’s additionally inefficient. It requires a number of orders of magnitude extra textual content than what youngsters must be taught the identical diploma of language fluency,” mentioned Meta.

JASCO: Enhanced text-to-music mannequin

On the inventive aspect, Meta’s JASCO permits producing music clips from textual content whereas affording extra management by accepting inputs like chords and beats.

“Whereas present text-to-music fashions like MusicGen rely primarily on textual content inputs for music era, our new mannequin, JASCO, is able to accepting numerous inputs, akin to chords or beat, to enhance management over generated music outputs,” defined Meta.

AudioSeal: Detecting AI-generated speech

Meta claims AudioSeal is the primary audio watermarking system designed to detect AI-generated speech. It may well pinpoint the precise segments generated by AI inside bigger audio clips as much as 485x sooner than earlier strategies.

“AudioSeal is being launched beneath a business license. It’s simply certainly one of a number of strains of accountable analysis we have now shared to assist forestall the misuse of generative AI instruments,” mentioned Meta.

Bettering text-to-image variety

One other essential launch goals to enhance the variety of text-to-image fashions which might usually exhibit geographical and cultural biases.

Meta developed automated indicators to guage potential geographical disparities and performed a big 65,000+ annotation research to know how individuals globally understand geographic illustration.

“This allows extra variety and higher illustration in AI-generated pictures,” mentioned Meta. The related code and annotations have been launched to assist enhance variety throughout generative fashions.

By publicly sharing these groundbreaking fashions, Meta says it hopes to foster collaboration and drive innovation inside the AI neighborhood.

See also  DeepSeek V3.1 just dropped — and it might be the most powerful open AI yet

(Picture by Dima Solomin)

See additionally: NVIDIA presents newest developments in visible AI

Wish to be taught extra about AI and large information from trade leaders? Take a look at AI & Big Data Expo going down in Amsterdam, California, and London. The excellent occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise know-how occasions and webinars powered by TechForge here.

Tags: ai, synthetic intelligence, audioseal, chameleon, honest, jasco, meta, meta ai, fashions, music era, open supply, text-to-image

Source link

TAGGED: generation, Meta, models, multimodal, Music, processing, unveils
Share This Article
Twitter Email Copy Link Print
Previous Article Microsoft and OpenAI say hackers are using ChatGPT to improve cyberattacks Update your Windows PC to avoid a serious Wi-Fi vulnerability
Next Article Jan Loeffler, CTO at WebPros: The Influence of AI in Hosting Jan Loeffler, CTO at WebPros: The Influence of AI in Hosting
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Can the grid cope with AI’s growing appetite?

Because the AI Power Council gathers, the query hanging within the air is: how will…

June 30, 2025

What game companies can learn from AI analysis of 1.5M gamer conversations | Creativ Company

Creativ Company is rising right now as a brand new form of market intelligence firm.…

June 8, 2025

Laing O’Rourke achieves major milestone for Pure Data Centres in Abu Dhabi

The location, which can present 45MW of functionality total, is Pure DC’s first enterprise within…

March 17, 2025

‘AI Greenferencing’ Model Could Transform Data Centers with Wind Power

With AI workloads surging and information middle energy consumption climbing to unprecedented ranges, a staff…

July 12, 2025

LiquidStack unveils GigaModular Coolant Distribution Unit

LiquidStack has unveiled its all-new GigaModular™ CDU —the business’s first modular, scalable Coolant Distribution Unit…

June 3, 2025

You Might Also Like

Why most enterprise AI coding pilots underperform (Hint: It's not the model)
AI

Why most enterprise AI coding pilots underperform (Hint: It's not the model)

By saad
Newsweek: Building AI-resilience for the next era of information
AI

Newsweek: Building AI-resilience for the next era of information

By saad
Google’s new framework helps AI agents spend their compute and tool budget more wisely
AI

Google’s new framework helps AI agents spend their compute and tool budget more wisely

By saad
BBVA embeds AI into banking workflows using ChatGPT Enterprise
AI

BBVA embeds AI into banking workflows using ChatGPT Enterprise

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.