Bitcoin

Bitcoin

$119,050.67

BTC 1.30%

Ethereum

Ethereum

$2,984.67

ETH 1.51%

  • Login
  • Register
Metaverse Media Group
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
No Result
View All Result
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
No Result
View All Result
Metaverse Media Group

AI system StreamDiT generates livestream videos from text at 16 fps 512p

AI system StreamDiT generates livestream videos from text at 16 fps 512p

The Decoderby The Decoder
13 July 2025
A new AI system called StreamDiT can generate livestream videos from text descriptions, opening up new possibilities for gaming and interactive media. The article AI system StreamDiT generates livestream videos from text at 16 fps 512p appeared first on THE DECODER….

summary
Summary

A new AI system called StreamDiT can generate livestream videos from text descriptions, opening up new possibilities for gaming and interactive media.

Developed by researchers at Meta and the University of California, Berkeley, StreamDiT creates videos in real time at 16 frames per second using a single high-end GPU. The model, with 4 billion parameters, outputs videos at 512p resolution. Unlike previous methods that generate full video clips before playback, StreamDiT produces video streams live, frame by frame.

Video: Kodaira et al.

The team showcased various use cases. StreamDiT can generate minute-long videos on the fly, respond to interactive prompts, and even edit existing videos in real time. In one demo, a pig in a video was transformed into a cat while the background stayed the same.

THE DECODER Newsletter
The most important AI news straight to your inbox.
✓ Weekly
✓ Cancel at any time

Four frames: input video of a running pig (top) and output frames transformed into a cat via prompt (bottom) in a graffiti alley.
Using a text prompt, StreamDiT transforms a running pig in the input video into a cat in the output stream, demonstrating real-time prompt-based video editing. | Image: Kodaira et al.
Recommend our article

The system relies on a custom architecture built for speed. StreamDiT uses a moving buffer to process multiple frames simultaneously, working on the next frame while outputting the previous one. New frames start out noisy but are gradually refined until they are ready for display. According to the paper, the system takes about half a second to generate two frames, producing eight finished images after processing.

Schematic buffer division into K reference frames and N chunks; alongside this, auto-denoise steps with decreasing correlation values
StreamDiT divides its buffer into fixed reference frames and short chunks. An auto-sequence visualization shows image similarity decreasing (from green to red) as denoising progresses. | Image: Kodaira et al.

Training for versatility

The training process was designed to improve versatility. Instead of focusing on a single video creation method, the model was trained with several approaches, using 3,000 high-quality videos and a larger dataset of 2.6 million videos. Training took place on 128 Nvidia H100 GPUs. The researchers found that mixing chunk sizes from 1 to 16 frames produced the best results.

To achieve real-time performance, the team introduced an acceleration technique that cuts the number of required calculation steps from 128 to just 8, with minimal impact on image quality. The architecture is also optimized for efficiency: rather than having every image element interact with all others, information is exchanged only between local regions.

In head-to-head comparisons, StreamDiT outperformed existing methods like ReuseDiffuse and FIFO diffusion, especially for videos with a lot of movement. While other models tended to create static scenes, StreamDiT generated more dynamic and natural motion.

Human raters evaluated the system’s performance on fluidity of motion, completeness of animation, consistency across frames, and overall quality. In every category, StreamDiT came out on top when tested on eight-second, 512p videos.

Recommendation
Stacked bar charts: Percentage win rates of ours vs. ReuseDiffuse (left) and ours vs. FIFO (right) for four evaluation axes.
Human raters assessed motion naturalness, motion completeness, frame consistency, and overall experience. | Image: Kodaira et al.

Bigger model, better quality—but slower

The team also experimented with a much larger 30-billion-parameter model, which delivered even higher video quality, though it wasn’t fast enough for real-time use. The results suggest the approach can scale to larger systems.

Video: Kodaira et al.

Some limitations remain, including StreamDiT’s limited ability to “remember” earlier parts of a video and occasional visible transitions between sections. The researchers say they are working on solutions.

Other companies are also exploring real-time AI video generation. Odyssey, for example, recently introduced an autoregressive world model that adapts video frame by frame in response to user input, making interactive experiences more accessible.

Join our community
Join the DECODER community on Discord, Reddit or Twitter – we can’t wait to meet you.

Read the full article on The-Decoder.com
in AI
Reading Time: 4 mins read
0
0
22
VIEWS
Share on TwitterShare on Facebook

Subscribe to our newsletter

For the latest news & monthly prize giveaways
Join Now

Subscribe to our newsletter

For the latest news & monthly prize giveaways
Join Now
ADVERTISEMENT

Related Posts

Grok 4 is not officially instructed to follow Musk’s views but often does on sensitive subjects
AI

xAI says it wants to fix Grok 4 because referencing Musk’s views is not right for a truth-seeking AI

11 hours ago
21
Elon Musk’s AI company xAI apologizes “deeply” for Grok’s “horrific behavior”
AI

Elon Musk’s AI company xAI apologizes “deeply” for Grok’s “horrific behavior”

11 hours ago
21
Researchers used 1,600 YouTube fail videos to show AI models struggle with surprises
AI

Researchers used 1,600 YouTube fail videos to show AI models struggle with surprises

13 hours ago
21

Comments

Please login to join discussion
ADVERTISEMENT

Latest News

  • All
  • Crypto
  • NFTs
  • Technology
  • Business
Record Bitcoin Prices Fail to Spark Search Frenzy, Google Trends Data Shows
Crypto

Record Bitcoin Prices Fail to Spark Search Frenzy, Google Trends Data Shows

Bitcoin.com News
by Bitcoin.com News
1 hour ago
21
Bitcoin Climbs to No. 6 Spot Among Global Market Giants; Closes in on Amazon
Crypto

Bitcoin Climbs to No. 6 Spot Among Global Market Giants; Closes in on Amazon

Bitcoin.com News
by Bitcoin.com News
3 hours ago
21
Dead Broke in Canada: Could Bitcoin–or Joining the US–Be the Answer?
Crypto

Dead Broke in Canada: Could Bitcoin–or Joining the US–Be the Answer?

Bitcoin.com News
by Bitcoin.com News
3 hours ago
21
Record Highs, Record-Low Selling Pressure: Cryptoquant Documents Unusual Market Calm
Crypto

Record Highs, Record-Low Selling Pressure: Cryptoquant Documents Unusual Market Calm

Bitcoin.com News
by Bitcoin.com News
5 hours ago
21
BTC’s $118K Rally Wipes out $1B in Shorts, Canadian Woman Sues Over Sim-Swap Scam, and More — Week in Review
Crypto

BTC’s $118K Rally Wipes out $1B in Shorts, Canadian Woman Sues Over Sim-Swap Scam, and More — Week in Review

Bitcoin.com News
by Bitcoin.com News
6 hours ago
21
Bitcoin Rocket Ship Blasts Past $119K as Bull Run Accelerates
Crypto

Bitcoin Rocket Ship Blasts Past $119K as Bull Run Accelerates

Bitcoin.com News
by Bitcoin.com News
6 hours ago
21
Load More
Next Post
Bitcoin Price Watch: High-Stakes Consolidation Could Define Q3 Trend

Bitcoin Price Watch: High-Stakes Consolidation Could Define Q3 Trend

ADVERTISEMENT

Follow Us

Categories

  • Crypto
  • NFTs
  • AI
  • Technology
  • Business
  • Crypto
  • NFTs
  • AI
  • Technology
  • Business
Subscribe to our Newsletter

© 2022 Metaverse Media Group – The Metaverse Mecca

Privacy and Cookie Policy | Sitemap

Welcome Back!

Sign In with Google
OR

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Sign Up with Google
OR

Fill the forms below to register

*By registering into our website, you agree to the Terms & Conditions and Privacy Policy.
All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
Bitcoin

Bitcoin

$119,050.67

BTC 1.30%

Ethereum

Ethereum

$2,984.67

ETH 1.51%

  • Login
  • Sign Up
This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.

Subscribe to our newsletter

Get the latest news & win monthly prizes

Subscribe to our newsletter

For the Latest News and Monthly Prize Giveaways

Join Now
Join Now