A Self-Supervised Approach on Motion Calibration for Enhancing Physical Plausibility in Text-to-Motion
Gahyeon Shim, Soogeun Park, Hyemin Ahn

TL;DR
This paper presents DMC, a self-supervised post-hoc module that refines text-to-motion generated human motions, improving physical realism such as foot contact and reducing artifacts, without complex physical modeling.
Contribution
Introduction of DMC, a self-supervised, data-driven motion calibrator that enhances physical plausibility in text-to-motion generation models without requiring physical simulation.
Findings
Reduces FID score by 42.74% on T2M
Improves physical plausibility by reducing foot penetration by 33%
Enhances semantic consistency and motion realism
Abstract
Generating semantically aligned human motion from textual descriptions has made rapid progress, but ensuring both semantic and physical realism in motion remains a challenge. In this paper, we introduce the Distortion-aware Motion Calibrator (DMC), a post-hoc module that refines physically implausible motions (e.g., foot floating) while preserving semantic consistency with the original textual description. Rather than relying on complex physical modeling, we propose a self-supervised and data-driven approach, whereby DMC learns to obtain physically plausible motions when an intentionally distorted motion and the original textual descriptions are given as inputs. We evaluate DMC as a post-hoc module to improve motions obtained from various text-to-motion generation models and demonstrate its effectiveness in improving physical plausibility while enhancing semantic consistency. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Human Pose and Action Recognition · 3D Shape Modeling and Analysis
