Skeleton2Stage: Reward-Guided Fine-Tuning for Physically Plausible Dance Generation

Jidong Jia; Youjian Zhang; Huan Fu; Dacheng Tao

arXiv:2602.13778·cs.CV·February 17, 2026

Skeleton2Stage: Reward-Guided Fine-Tuning for Physically Plausible Dance Generation

Jidong Jia, Youjian Zhang, Huan Fu, Dacheng Tao

PDF

Open Access

TL;DR

This paper introduces Skeleton2Stage, a reinforcement learning approach that fine-tunes dance generation models to produce physically plausible motions by incorporating physics-based rewards and addressing issues like freezing motions.

Contribution

It proposes a novel reward-guided fine-tuning method that improves physical plausibility in dance synthesis, incorporating anti-freezing rewards to preserve motion dynamics.

Findings

01

Significantly reduces body self-penetration and foot-ground contact anomalies.

02

Produces more realistic and aesthetically pleasing dance motions.

03

Enhances the physical plausibility of generated dances across multiple datasets.

Abstract

Despite advances in dance generation, most methods are trained in the skeletal domain and ignore mesh-level physical constraints. As a result, motions that look plausible as joint trajectories often exhibit body self-penetration and Foot-Ground Contact (FGC) anomalies when visualized with a human body mesh, reducing the aesthetic appeal of generated dances and limiting their real-world applications. We address this skeleton-to-mesh gap by deriving physics-based rewards from the body mesh and applying Reinforcement Learning Fine-Tuning (RLFT) to steer the diffusion model toward physically plausible motion synthesis under mesh visualization. Our reward design combines (i) an imitation reward that measures a motion's general plausibility by its imitability in a physical simulator (penalizing penetration and foot skating), and (ii) a Foot-Ground Deviation (FGD) reward with test-time FGD…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · 3D Shape Modeling and Analysis · Human Pose and Action Recognition