AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo, Ceyuan Yang, Anyi Rao, Zhengyang Liang, Yaohui Wang, Yu, Qiao, Maneesh Agrawala, Dahua Lin, Bo Dai

TL;DR
AnimateDiff introduces a versatile framework that enables the animation of personalized text-to-image diffusion models without needing model-specific tuning, by learning transferable motion priors from videos.
Contribution
The paper presents a plug-and-play motion module and MotionLoRA fine-tuning technique that facilitate animation of personalized T2I models with minimal additional training.
Findings
Generates smooth, high-quality animations from personalized T2I models.
The motion module is transferable across models from the same base.
MotionLoRA efficiently adapts to new motion patterns with low cost.
Abstract
With the advance of text-to-image (T2I) diffusion models (e.g., Stable Diffusion) and corresponding personalization techniques such as DreamBooth and LoRA, everyone can manifest their imagination into high-quality images at an affordable cost. However, adding motion dynamics to existing high-quality personalized T2Is and enabling them to generate animations remains an open challenge. In this paper, we present AnimateDiff, a practical framework for animating personalized T2I models without requiring model-specific tuning. At the core of our framework is a plug-and-play motion module that can be trained once and seamlessly integrated into any personalized T2Is originating from the same base T2I. Through our proposed training strategy, the motion module effectively learns transferable motion priors from real-world videos. Once trained, the motion module can be inserted into a personalized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsImage Retrieval and Classification Techniques · Human Motion and Animation · Music and Audio Processing
MethodsDiffusion · Balanced Selection
