DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Jinbo Xing; Menghan Xia; Yong Zhang; Haoxin Chen; Wangbo Yu; Hanyuan; Liu; Xintao Wang; Tien-Tsin Wong; Ying Shan

arXiv:2310.12190·cs.CV·November 28, 2023·6 cites

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Jinbo Xing, Menghan Xia, Yong Zhang, Haoxin Chen, Wangbo Yu, Hanyuan, Liu, Xintao Wang, Tien-Tsin Wong, Ying Shan

PDF

Open Access 2 Repos 9 Models

TL;DR

DynamiCrafter introduces a novel method for animating open-domain images by leveraging text-to-video diffusion models and image guidance, producing natural and convincing animated videos from static images.

Contribution

The paper presents a new approach that combines diffusion priors with image guidance to animate diverse images beyond traditional domain-specific methods.

Findings

01

Produces visually convincing animations

02

Achieves higher conformity to input images

03

Outperforms existing animation techniques

Abstract

Animating a still image offers an engaging visual experience. Traditional image animation techniques mainly focus on animating natural scenes with stochastic dynamics (e.g. clouds and fluid) or domain-specific motions (e.g. human hair or body motions), and thus limits their applicability to more general visual content. To overcome this limitation, we explore the synthesis of dynamic content for open-domain images, converting them into animated videos. The key idea is to utilize the motion prior of text-to-video diffusion models by incorporating the image into the generative process as guidance. Given an image, we first project it into a text-aligned rich context representation space using a query transformer, which facilitates the video model to digest the image content in a compatible fashion. However, some visual details still struggle to be preserved in the resultant videos. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Motion and Animation · Computer Graphics and Visualization Techniques

MethodsFocus · Diffusion