Continuous-Time Video Generation via Learning Motion Dynamics with Neural ODE
Kangyeol Kim, Sunghyun Park, Junsoo Lee, Joonseok Lee, Sookyung Kim,, Jaegul Choo, Edward Choi

TL;DR
This paper introduces a novel continuous-time video generation method using Neural ODEs to model natural motion dynamics, enabling high-quality, flexible, and transferable video synthesis.
Contribution
It proposes a two-stage approach that separately models motion with Neural ODEs and appearance, allowing continuous motion learning and versatile video generation.
Findings
Outperforms recent baselines quantitatively
Enables dynamic frame rate manipulation
Allows motion transfer between datasets
Abstract
In order to perform unconditional video generation, we must learn the distribution of the real-world videos. In an effort to synthesize high-quality videos, various studies attempted to learn a mapping function between noise and videos, including recent efforts to separate motion distribution and appearance distribution. Previous methods, however, learn motion dynamics in discretized, fixed-interval timesteps, which is contrary to the continuous nature of motion of a physical body. In this paper, we propose a novel video generation approach that learns separate distributions for motion and appearance, the former modeled by neural ODE to learn natural motion dynamics. Specifically, we employ a two-stage approach where the first stage converts a noise vector to a sequence of keypoints in arbitrary frame rates, and the second stage synthesizes videos based on the given keypoints sequence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques
