MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Shenghai Yuan, Jinfa Huang, Yujun Shi, Yongqi Xu, Ruijie Zhu, Bin Lin,, Xinhua Cheng, Li Yuan, Jiebo Luo

TL;DR
MagicTime is a novel model that learns physical knowledge from time-lapse videos to generate high-quality, dynamic metamorphic videos, advancing the realism and variability in Text-to-Video synthesis.
Contribution
It introduces a metamorphic video generation framework with a MagicAdapter, Dynamic Frames Extraction, and Magic Text-Encoder, and provides a new dataset ChronoMagic for training and evaluation.
Findings
Outperforms existing models in generating realistic metamorphic videos
Effectively encodes physical knowledge from time-lapse videos
Demonstrates the potential of time-lapse video generation for physical world simulation
Abstract
Recent advances in Text-to-Video generation (T2V) have achieved remarkable success in synthesizing high-quality general videos from textual descriptions. A largely overlooked problem in T2V is that existing models have not adequately encoded physical knowledge of the real world, thus generated videos tend to have limited motion and poor variations. In this paper, we propose \textbf{MagicTime}, a metamorphic time-lapse video generation model, which learns real-world physics knowledge from time-lapse videos and implements metamorphic generation. First, we design a MagicAdapter scheme to decouple spatial and temporal training, encode more physical knowledge from metamorphic videos, and transform pre-trained T2V models to generate metamorphic videos. Second, we introduce a Dynamic Frames Extraction strategy to adapt to metamorphic time-lapse videos, which have a wider variation range and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Computer Graphics and Visualization Techniques · Artificial Intelligence in Games
