InterReal: A Unified Physics-Based Imitation Framework for Learning Human-Object Interaction Skills

Dayang Liang; Yuhang Lin; Xinzhe Liu; Jiyuan Shi; Yunlong Liu; Chenjia Bai

arXiv:2603.07516·cs.RO·March 10, 2026

InterReal: A Unified Physics-Based Imitation Framework for Learning Human-Object Interaction Skills

Dayang Liang, Yuhang Lin, Xinzhe Liu, Jiyuan Shi, Yunlong Liu, Chenjia Bai

PDF

Open Access

TL;DR

InterReal is a physics-based imitation learning framework that enables humanoid robots to learn and perform human-object interactions with high accuracy and robustness in real-world settings.

Contribution

The paper introduces a unified framework combining data augmentation, an automatic reward learner, and meta-policy guidance for improved human-object interaction learning.

Findings

01

Achieves superior tracking accuracy in HOI tasks

02

Attains higher success rates compared to recent baselines

03

Demonstrates effective real-world robot deployment

Abstract

Interaction is one of the core abilities of humanoid robots. However, most existing frameworks focus on non-interactive whole-body control, which limits their practical applicability. In this work, we develop InterReal, a unified physics-based imitation learning framework for Real-world human-object Interaction (HOI) control. InterReal enables humanoid robots to track HOI reference motions, facilitating the learning of fine-grained interactive skills and their deployment in real-world settings. Within this framework, we first introduce a HOI motion data augmentation scheme with hand-object contact constraints, and utilize the augmented motions to improve policy stability under object perturbations. Second, we propose an automatic reward learner to address the challenge of large-scale reward shaping. A meta-policy guided by critical tracking error metrics explores and allocates reward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Social Robot Interaction and HRI