Robust Dancer: Long-term 3D Dance Synthesis Using Unpaired Data
Bin Feng, Tenglong Ao, Zequn Liu, Wei Ju, Libin Liu, Ming Zhang

TL;DR
Robust Dancer introduces a novel 3D dance synthesis system that generates realistic long-term dance motions from music using unpaired data, leveraging disentangled beat and style representations with a Transformer-based model and long-history attention.
Contribution
It is the first system to enable long-term 3D dance synthesis trained solely on unpaired data, overcoming limitations of data pairing and error accumulation.
Findings
Achieves comparable results to baseline methods without paired data.
Generates robust long-term dance sequences aligned with music.
Outperforms existing methods in unpaired data scenarios.
Abstract
How to automatically synthesize natural-looking dance movements based on a piece of music is an incrementally popular yet challenging task. Most existing data-driven approaches require hard-to-get paired training data and fail to generate long sequences of motion due to error accumulation of autoregressive structure. We present a novel 3D dance synthesis system that only needs unpaired data for training and could generate realistic long-term motions at the same time. For the unpaired data training, we explore the disentanglement of beat and style, and propose a Transformer-based model free of reliance upon paired data. For the synthesis of long-term motions, we devise a new long-history attention strategy. It first queries the long-history embedding through an attention computation and then explicitly fuses this embedding into the generation pipeline via multimodal adaptation gate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Human Pose and Action Recognition · Music and Audio Processing
Methodsfail
