Skeleton2Stage: Reward-Guided Fine-Tuning for Physically Plausible Dance Generation
Jidong Jia, Youjian Zhang, Huan Fu, Dacheng Tao

TL;DR
This paper introduces Skeleton2Stage, a reinforcement learning approach that fine-tunes dance generation models to produce physically plausible motions by incorporating physics-based rewards and addressing issues like freezing motions.
Contribution
It proposes a novel reward-guided fine-tuning method that improves physical plausibility in dance synthesis, incorporating anti-freezing rewards to preserve motion dynamics.
Findings
Significantly reduces body self-penetration and foot-ground contact anomalies.
Produces more realistic and aesthetically pleasing dance motions.
Enhances the physical plausibility of generated dances across multiple datasets.
Abstract
Despite advances in dance generation, most methods are trained in the skeletal domain and ignore mesh-level physical constraints. As a result, motions that look plausible as joint trajectories often exhibit body self-penetration and Foot-Ground Contact (FGC) anomalies when visualized with a human body mesh, reducing the aesthetic appeal of generated dances and limiting their real-world applications. We address this skeleton-to-mesh gap by deriving physics-based rewards from the body mesh and applying Reinforcement Learning Fine-Tuning (RLFT) to steer the diffusion model toward physically plausible motion synthesis under mesh visualization. Our reward design combines (i) an imitation reward that measures a motion's general plausibility by its imitability in a physical simulator (penalizing penetration and foot skating), and (ii) a Foot-Ground Deviation (FGD) reward with test-time FGD…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · 3D Shape Modeling and Analysis · Human Pose and Action Recognition
