AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models   without Specific Tuning

Yuwei Guo; Ceyuan Yang; Anyi Rao; Zhengyang Liang; Yaohui Wang; Yu; Qiao; Maneesh Agrawala; Dahua Lin; Bo Dai

arXiv:2307.04725·cs.CV·February 9, 2024·84 cites

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Yuwei Guo, Ceyuan Yang, Anyi Rao, Zhengyang Liang, Yaohui Wang, Yu, Qiao, Maneesh Agrawala, Dahua Lin, Bo Dai

PDF

Open Access 5 Repos 5 Models 1 Video

TL;DR

AnimateDiff introduces a versatile framework that enables the animation of personalized text-to-image diffusion models without needing model-specific tuning, by learning transferable motion priors from videos.

Contribution

The paper presents a plug-and-play motion module and MotionLoRA fine-tuning technique that facilitate animation of personalized T2I models with minimal additional training.

Findings

01

Generates smooth, high-quality animations from personalized T2I models.

02

The motion module is transferable across models from the same base.

03

MotionLoRA efficiently adapts to new motion patterns with low cost.

Abstract

With the advance of text-to-image (T2I) diffusion models (e.g., Stable Diffusion) and corresponding personalization techniques such as DreamBooth and LoRA, everyone can manifest their imagination into high-quality images at an affordable cost. However, adding motion dynamics to existing high-quality personalized T2Is and enabling them to generate animations remains an open challenge. In this paper, we present AnimateDiff, a practical framework for animating personalized T2I models without requiring model-specific tuning. At the core of our framework is a plug-and-play motion module that can be trained once and seamlessly integrated into any personalized T2Is originating from the same base T2I. Through our proposed training strategy, the motion module effectively learns transferable motion priors from real-world videos. Once trained, the motion module can be inserted into a personalized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning· slideslive

Taxonomy

TopicsImage Retrieval and Classification Techniques · Human Motion and Animation · Music and Audio Processing

MethodsDiffusion · Balanced Selection