Differential Motion Evolution for Fine-Grained Motion Deformation in Unsupervised Image Animation
Peirong Liu, Rui Wang, Xuefei Cao, Yipin Zhou, Ashish Shah, Ser-Nam, Lim

TL;DR
DiME introduces an unsupervised framework using differential motion evolution to improve image animation, especially under large motion discrepancies, by regularizing motion fields and leveraging multiple views for better generalization.
Contribution
The paper proposes DiME, a novel unsupervised motion transfer method that employs differential equations and multi-view modeling to enhance motion estimation and occlusion in image animation.
Findings
Outperforms state-of-the-art methods on 9 benchmarks.
Effectively captures large motion deformations.
Generalizes well to unseen objects.
Abstract
Image animation is the task of transferring the motion of a driving video to a given object in a source image. While great progress has recently been made in unsupervised motion transfer, requiring no labeled data or domain priors, many current unsupervised approaches still struggle to capture the motion deformations when large motion/view discrepancies occur between the source and driving domains. Under such conditions, there is simply not enough information to capture the motion field properly. We introduce DiME (Differential Motion Evolution), an end-to-end unsupervised motion transfer framework integrating differential refinement for motion estimation. Key findings are twofold: (1) by capturing the motion transfer with an ordinary differential equation (ODE), it helps to regularize the motion field, and (2) by utilizing the source image itself, we are able to inpaint…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition
