A Self-Supervised Approach on Motion Calibration for Enhancing Physical Plausibility in Text-to-Motion

Gahyeon Shim; Soogeun Park; Hyemin Ahn

arXiv:2602.18199·cs.CV·February 23, 2026

A Self-Supervised Approach on Motion Calibration for Enhancing Physical Plausibility in Text-to-Motion

Gahyeon Shim, Soogeun Park, Hyemin Ahn

PDF

Open Access

TL;DR

This paper presents DMC, a self-supervised post-hoc module that refines text-to-motion generated human motions, improving physical realism such as foot contact and reducing artifacts, without complex physical modeling.

Contribution

Introduction of DMC, a self-supervised, data-driven motion calibrator that enhances physical plausibility in text-to-motion generation models without requiring physical simulation.

Findings

01

Reduces FID score by 42.74% on T2M

02

Improves physical plausibility by reducing foot penetration by 33%

03

Enhances semantic consistency and motion realism

Abstract

Generating semantically aligned human motion from textual descriptions has made rapid progress, but ensuring both semantic and physical realism in motion remains a challenge. In this paper, we introduce the Distortion-aware Motion Calibrator (DMC), a post-hoc module that refines physically implausible motions (e.g., foot floating) while preserving semantic consistency with the original textual description. Rather than relying on complex physical modeling, we propose a self-supervised and data-driven approach, whereby DMC learns to obtain physically plausible motions when an intentionally distorted motion and the original textual descriptions are given as inputs. We evaluate DMC as a post-hoc module to improve motions obtained from various text-to-motion generation models and demonstrate its effectiveness in improving physical plausibility while enhancing semantic consistency. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · 3D Shape Modeling and Analysis