InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions

Sirui Xu; Hung Yu Ling; Yu-Xiong Wang; Liang-Yan Gui

arXiv:2502.20390·cs.CV·February 3, 2026

InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions

Sirui Xu, Hung Yu Ling, Yu-Xiong Wang, Liang-Yan Gui

PDF

Open Access 1 Repo

TL;DR

InterMimic introduces a curriculum-based framework enabling a single policy to learn realistic, diverse, and generalizable human-object interactions from imperfect motion capture data, surpassing simple imitation through reinforcement learning.

Contribution

The paper presents a novel curriculum strategy and distillation approach for training a universal policy for complex human-object interactions from imperfect data.

Findings

01

Produces realistic and diverse HOIs across datasets

02

Generalizes zero-shot to new interactions

03

Integrates with kinematic generators for generative modeling

Abstract

Achieving realistic simulations of humans interacting with a wide range of objects has long been a fundamental goal. Extending physics-based motion imitation to complex human-object interactions (HOIs) is challenging due to intricate human-object coupling, variability in object geometries, and artifacts in motion capture data, such as inaccurate contacts and limited hand detail. We introduce InterMimic, a framework that enables a single policy to robustly learn from hours of imperfect MoCap data covering diverse full-body interactions with dynamic and varied objects. Our key insight is to employ a curriculum strategy -- perfect first, then scale up. We first train subject-specific teacher policies to mimic, retarget, and refine motion capture data. Next, we distill these teachers into a student policy, with the teachers acting as online experts providing direct supervision, as well as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Sirui-Xu/InterMimic
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Robot Manipulation and Learning · Human Pose and Action Recognition