DANCER: Dance ANimation via Condition Enhancement and Rendering with diffusion model
Yucheng Xing, Jinxing Yin, Xiaodong Liu

TL;DR
DANCER is a novel diffusion-based framework for realistic single-person dance synthesis that leverages condition enhancement modules and a new large-scale dataset to improve video quality and motion accuracy.
Contribution
The paper introduces DANCER, combining appearance and pose guidance modules with a large TikTok-3K dataset to advance dance video generation with diffusion models.
Findings
Outperforms state-of-the-art methods in dance synthesis quality
Effectively captures human motion and appearance details
Demonstrates superior realism and continuity in generated videos
Abstract
Recently, diffusion models have shown their impressive ability in visual generation tasks. Besides static images, more and more research attentions have been drawn to the generation of realistic videos. The video generation not only has a higher requirement for the quality, but also brings a challenge in ensuring the video continuity. Among all the video generation tasks, human-involved contents, such as human dancing, are even more difficult to generate due to the high degrees of freedom associated with human motions. In this paper, we propose a novel framework, named as DANCER (Dance ANimation via Condition Enhancement and Rendering with Diffusion Model), for realistic single-person dance synthesis based on the most recent stable video diffusion model. As the video generation is generally guided by a reference image and a video sequence, we introduce two important modules into our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Human Motion and Animation · 3D Shape Modeling and Analysis
