DANCER: Dance ANimation via Condition Enhancement and Rendering with diffusion model

Yucheng Xing; Jinxing Yin; Xiaodong Liu

arXiv:2510.27169·cs.CV·November 3, 2025

DANCER: Dance ANimation via Condition Enhancement and Rendering with diffusion model

Yucheng Xing, Jinxing Yin, Xiaodong Liu

PDF

Open Access

TL;DR

DANCER is a novel diffusion-based framework for realistic single-person dance synthesis that leverages condition enhancement modules and a new large-scale dataset to improve video quality and motion accuracy.

Contribution

The paper introduces DANCER, combining appearance and pose guidance modules with a large TikTok-3K dataset to advance dance video generation with diffusion models.

Findings

01

Outperforms state-of-the-art methods in dance synthesis quality

02

Effectively captures human motion and appearance details

03

Demonstrates superior realism and continuity in generated videos

Abstract

Recently, diffusion models have shown their impressive ability in visual generation tasks. Besides static images, more and more research attentions have been drawn to the generation of realistic videos. The video generation not only has a higher requirement for the quality, but also brings a challenge in ensuring the video continuity. Among all the video generation tasks, human-involved contents, such as human dancing, are even more difficult to generate due to the high degrees of freedom associated with human motions. In this paper, we propose a novel framework, named as DANCER (Dance ANimation via Condition Enhancement and Rendering with Diffusion Model), for realistic single-person dance synthesis based on the most recent stable video diffusion model. As the video generation is generally guided by a reference image and a video sequence, we introduce two important modules into our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Motion and Animation · 3D Shape Modeling and Analysis