MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion
Shuyuan Tu, Qi Dai, Zihao Zhang, Sicheng Xie, Zhi-Qi Cheng, Chong Luo,, Xintong Han, Zuxuan Wu, Yu-Gang Jiang

TL;DR
MotionFollower is a lightweight diffusion-based video editing model that effectively modifies motion while preserving appearance and background, using novel score guidance and signal controllers for efficient and high-quality results.
Contribution
The paper introduces MotionFollower, a lightweight score-guided diffusion model with novel signal controllers for motion editing that outperforms existing models in efficiency and quality.
Findings
Achieves ~80% GPU memory reduction compared to MotionEditor.
Demonstrates superior motion editing performance both qualitatively and quantitatively.
Supports large camera movements and complex actions.
Abstract
Despite impressive advancements in diffusion-based video editing models in altering video attributes, there has been limited exploration into modifying motion information while preserving the original protagonist's appearance and background. In this paper, we propose MotionFollower, a lightweight score-guided diffusion model for video motion editing. To introduce conditional controls to the denoising process, MotionFollower leverages two of our proposed lightweight signal controllers, one for poses and the other for appearances, both of which consist of convolution blocks without involving heavy attention calculations. Further, we design a score guidance principle based on a two-branch architecture, including the reconstruction and editing branches, which significantly enhance the modeling capability of texture details and complicated backgrounds. Concretely, we enforce several…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Coding and Compression Technologies · Computer Graphics and Visualization Techniques · Advanced Vision and Imaging
MethodsConvolution · Diffusion
