Modelling Latent Dynamics of StyleGAN using Neural ODEs
Weihao Xia, Yujiu Yang, Jing-Hao Xue

TL;DR
This paper introduces a novel approach to model video dynamics by learning continuous latent trajectories using Neural ODEs, enabling high-quality frame interpolation and consistent video editing with reduced computation.
Contribution
It presents a new method that models latent space trajectories with Neural ODEs for improved video interpolation and editing, outperforming existing techniques.
Findings
Achieves state-of-the-art performance in video interpolation.
Enables infinite frame interpolation with less computation.
Maintains temporal consistency in video editing.
Abstract
In this paper, we propose to model the video dynamics by learning the trajectory of independently inverted latent codes from GANs. The entire sequence is seen as discrete-time observations of a continuous trajectory of the initial latent code, by considering each latent code as a moving particle and the latent space as a high-dimensional dynamic system. The latent codes representing different frames are therefore reformulated as state transitions of the initial frame, which can be modeled by neural ordinary differential equations. The learned continuous trajectory allows us to perform infinite frame interpolation and consistent video manipulation. The latter task is reintroduced for video editing with the advantage of requiring the core operations to be applied to the first frame only while maintaining temporal consistency across all frames. Extensive experiments demonstrate that our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Music and Audio Processing
