Thursday, 22 Jan 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Meta enters AI video wars with powerful Movie Gen model
AI

Meta enters AI video wars with powerful Movie Gen model

Last updated: October 6, 2024 2:14 am
Published October 6, 2024
Share
Meta enters AI video wars with powerful Movie Gen model
SHARE

Be a part of our day by day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Study Extra


Meta founder and CEO Mark Zuckerberg, who constructed the corporate atop of its hit social community Fb, completed this week sturdy, posting a video of himself doing a leg press train on a machine on the health club on his private Instagram (a social community Fb acquired in 2012).

Besides, within the video, the leg press machine transforms right into a neon cyberpunk model, an Historic Roman model, and a gold flaming model as effectively.

Because it turned out, Zuck was doing extra than simply exercising: he was utilizing the video to announce Movie Gen, Meta’s new household of generative multimodal AI fashions that may make each video and audio from textual content prompts, and permit customers to customise their very own movies, including particular results, props, costumes and altering choose parts merely by way of textual content steering, as Zuck did in his video.

The fashions seem like extraordinarily highly effective, permitting customers to vary solely chosen parts of a video clip moderately than “re-roll” or regenerate the whole factor, much like Pika’s spot modifying on older fashions, but with longer clip technology and sound inbuilt.

Meta’s exams, outlined in a technical paper on the mannequin household launched right now, present that it outperforms the main rivals within the area together with Runway Gen 3, Luma Dream Machine, OpenAI Sora and Kling 1.5 on many viewers rankings of various attributes comparable to consistency and “naturalness” of movement.

Meta has positioned Film Gen as a instrument for each on a regular basis customers trying to improve their digital storytelling in addition to skilled video creators and editors, even Hollywood filmmakers.

Film Gen represents Meta’s newest step ahead in generative AI expertise, combining video and audio capabilities inside a single system.

Specificially, Film Gen consists of 4 fashions:

1. Film Gen Video – a 30B parameter text-to-video technology mannequin

2. Film Gen Audio – a 13B parameter video-to-audio technology mannequin

3. Personalised Film Gen Video – a model of Film Gen Video post-trained to generate customized movies primarily based on an individual’s face

4. Film Gen Edit – a mannequin with a novel post-training process for exact video modifying

These fashions allow the creation of life like, customized HD movies of as much as 16 seconds at 16 FPS, together with 48kHz audio, and supply video modifying capabilities.

See also  Phantom data centers: What they are (or aren't) and why they're hampering the true promise of AI

Designed to deal with duties starting from customized video creation to stylish video modifying and high-quality audio technology, Film Gen leverages highly effective AI fashions to reinforce customers’ inventive choices.

Key options of the Film Gen suite embrace:

• Video Technology: With Film Gen, customers can produce high-definition (HD) movies by merely getting into textual content prompts. These movies might be rendered at 1080p decision, as much as 16 seconds lengthy, and are supported by a 30 billion-parameter transformer mannequin. The AI’s capability to handle detailed prompts permits it to deal with numerous features of video creation, together with digicam movement, object interactions, and environmental physics.

• Personalised Movies: Film Gen affords an thrilling customized video characteristic, the place customers can add a picture of themselves or others to be featured inside AI-generated movies. The mannequin can adapt to numerous prompts whereas sustaining the identification of the person, making it helpful for personalized content material creation.

• Exact Video Enhancing: The Film Gen suite additionally contains superior video modifying capabilities that enable customers to switch particular parts inside a video. This mannequin can alter localized features, like objects or colours, in addition to world adjustments, comparable to background swaps, all primarily based on easy textual content directions.

• Audio Technology: Along with video capabilities, Film Gen additionally incorporates a 13 billion-parameter audio technology mannequin. This characteristic allows the technology of sound results, ambient music, and synchronized audio that aligns seamlessly with visible content material. Customers can create Foley sounds (sound results amplifying but solidifying actual life noises like cloth ruffling and footsteps echoing), instrumental music, and different audio parts as much as 45 seconds lengthy. Meta posted an instance video with Foley sounds under (flip sound as much as hear it):

Skilled on billions of movies on-line

Film Gen is the newest development in Meta’s ongoing AI analysis efforts. To coach the fashions, Meta says it relied upon “web scale picture, video, and audio information,” particularly, 100 million movies and 1 billion photos from which it “learns concerning the visible world by ‘watching’ movies,” in keeping with the technical paper.

Nevertheless, Meta didn’t specify if the info was licensed within the paper or public area, or if it merely scraped it as many different AI mannequin makers have — resulting in criticism from artists and video creators comparable to YouTuber Marques Brownlee (MKBHD) — and, within the case of AI video mannequin supplier Runway, a class-action copyright infringement swimsuit by creators (nonetheless transferring by way of the courts). As such, one can count on Meta to face speedy criticism for its information sources.

See also  New o1 model of LLM at OpenAI could change hardware market

The authorized and moral questions concerning the coaching apart, Meta is clearly positioning the Film Gen creation course of as novel, utilizing a mixture of typical diffusion mannequin coaching (used generally in video and audio AI) alongside giant language mannequin (LLM) coaching and a brand new approach known as “Movement Matching,” the latter of which depends on modeling adjustments in a dataset’s distribution over time.

At every step, the mannequin learns to foretell the speed at which samples ought to “transfer” towards the goal distribution. Movement Matching differs from commonplace diffusion-based fashions in key methods:

• Zero Terminal Sign-to-Noise Ratio (SNR): Not like standard diffusion fashions, which require particular noise schedules to take care of a zero terminal SNR, Movement Matching inherently ensures zero terminal SNR with out extra changes. This gives robustness in opposition to the selection of noise schedules, contributing to extra constant and higher-quality video outputs  .

• Effectivity in Coaching and Inference: Movement Matching is discovered to be extra environment friendly each when it comes to coaching and inference in comparison with diffusion fashions. It affords flexibility when it comes to the kind of noise schedules used and exhibits improved efficiency throughout a spread of mannequin sizes. This strategy has additionally demonstrated higher alignment with human analysis outcomes.

The Film Gen system’s coaching course of focuses on maximizing flexibility and high quality for each video and audio technology. It depends on two principal fashions, every with intensive coaching and fine-tuning procedures:

• Film Gen Video Mannequin: This mannequin has 30 billion parameters and begins with fundamental text-to-image technology. It then progresses to text-to-video, producing movies as much as 16 seconds lengthy in HD high quality. The coaching course of entails a big dataset of movies and pictures, permitting the mannequin to know advanced visible ideas like movement, interactions, and digicam dynamics. To reinforce the mannequin’s capabilities, they fine-tuned it on a curated set of high-quality movies with textual content captions, which improved the realism and precision of its outputs. The staff additional expanded the mannequin’s flexibility by coaching it to deal with customized content material and modifying instructions.

See also  Meta plans to build $800 million, next-generation data center in Montgomery, Alabama

• Film Gen Audio Mannequin: With 13 billion parameters, this mannequin generates high-quality audio that syncs with visible parts within the video. The coaching set included over one million hours of audio, which allowed the mannequin to select up on each bodily and psychological connections between sound and visuals. They enhanced this mannequin by way of supervised fine-tuning, utilizing chosen high-quality audio and textual content pairs. This course of helped it generate life like ambient sounds, synced sound results, and mood-aligned background music for various video scenes.

It follows earlier initiatives like Make-A-Scene and the Llama Picture fashions, which centered on high-quality picture and animation technology.

This launch marks the third main milestone in Meta’s generative AI journey and underscores the corporate’s dedication to pushing the boundaries of media creation instruments.

Launching on Insta in 2025

Set to debut on Instagram in 2025, Film Gen is poised to make superior video creation extra accessible to the platform’s wide selection of customers.

Whereas the fashions are at the moment in a analysis section, Meta has expressed optimism that Film Gen will empower customers to supply compelling content material with ease.

Because the product continues to develop, Meta intends to collaborate with creators and filmmakers to refine Film Gen’s options and guarantee it meets person wants.

Meta’s long-term imaginative and prescient for Film Gen displays a broader aim of democratizing entry to stylish video modifying instruments. Whereas the suite affords appreciable potential, Meta acknowledges that generative AI instruments like Film Gen are supposed to improve, not substitute, the work {of professional} artists and animators.

As Meta prepares to convey Film Gen to market, the corporate stays centered on refining the expertise and addressing any present limitations. It plans additional optimizations aimed toward enhancing inference time and scaling up the mannequin’s capabilities. Meta has additionally hinted at potential future purposes, comparable to creating personalized animated greetings or quick movies fully pushed by person enter.

The discharge of Film Gen may sign a brand new period for content material creation on Meta’s platforms, with Instagram customers among the many first to expertise this progressive instrument. Because the expertise evolves, Film Gen may change into an important a part of Meta’s ecosystem and that of creators — professional and indie alike.


Source link
TAGGED: enters, Gen, Meta, Model, Movie, powerful, Video, wars
Share This Article
Twitter Email Copy Link Print
Previous Article Middle East Israel magnifying glass security US modifies AI chip export rules, boosting Middle East access
Next Article Legrand acquires ZPE Systems - Data Centre Review Eaton & CTS Nordics open NordicEPOD factory
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

A machine using ultrasound and AI can gauge the fattiness of a tuna fish

A show reveals tuna sushi units being offered at a market stall in Tokyo, Japan,…

April 21, 2025

Sendsprint Acquires Nobel Financial

Sendsprint, a Totowa, New Jersey-based worldwide fee and remittance startup, has acquired Nobel Monetary Inc., a…

August 9, 2024

Bechtel Breaks Ground on $4B Sodium-Cooled Nuclear Reactor

Reston, Virginia-based Bechtel not too long ago broke floor in Kemmerer, Wyoming, on an superior nuclear…

June 28, 2024

Roblox breaks ground on data center in Brazil for early 2026

At Gamescom Latam, Roblox introduced it has damaged floor on a brand new information heart in…

May 2, 2025

ABM Respiratory Care Closes $14.8M Series B Funding

ABM Respiratory Care, an Eagan, MN-based medical expertise firm creating airway clearance and air flow…

June 18, 2025

You Might Also Like

OpenCog Hyperon and AGI: Beyond large language models
AI

OpenCog Hyperon and AGI: Beyond large language models

By saad
The quiet work behind Citi’s 4,000-person internal AI rollout
AI

The quiet work behind Citi’s 4,000-person internal AI rollout

By saad
Balancing AI cost efficiency with data sovereignty
AI

Balancing AI cost efficiency with data sovereignty

By saad
Claude Code costs up to $200 a month. Goose does the same thing for free.
AI

Claude Code costs up to $200 a month. Goose does the same thing for free.

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.