Learning Generalizable Hand-Object Tracking from Synthetic Demonstrations

Yinhuai Wang; Runyi Yu; Hok Wai Tsui; Xiaoyi Lin; Hui Zhang; Qihan Zhao; Ke Fan; Miao Li; Jie Song; Jingbo Wang; Qifeng Chen; Ping Tan

arXiv:2512.19583·cs.RO·December 23, 2025

Learning Generalizable Hand-Object Tracking from Synthetic Demonstrations

Yinhuai Wang, Runyi Yu, Hok Wai Tsui, Xiaoyi Lin, Hui Zhang, Qihan Zhao, Ke Fan, Miao Li, Jie Song, Jingbo Wang, Qifeng Chen, Ping Tan

PDF

Open Access

TL;DR

This paper introduces a synthetic-data-based system for learning generalizable hand-object tracking controllers, enabling dexterous manipulation without human demonstrations and extending to diverse objects and hand types.

Contribution

The paper presents HOP and HOT, novel components for synthesizing trajectories and bridging synthetic-to-physical transfer, advancing manipulation control learning from synthetic data.

Findings

01

Enables tracking of complex, long-horizon sequences

02

Works across diverse object shapes and hand morphologies

03

Achieves scalable manipulation controllers from synthetic data

Abstract

We present a system for learning generalizable hand-object tracking controllers purely from synthetic data, without requiring any human demonstrations. Our approach makes two key contributions: (1) HOP, a Hand-Object Planner, which can synthesize diverse hand-object trajectories; and (2) HOT, a Hand-Object Tracker that bridges synthetic-to-physical transfer through reinforcement learning and interaction imitation learning, delivering a generalizable controller conditioned on target hand-object states. Our method extends to diverse object shapes and hand morphologies. Through extensive evaluations, we show that our approach enables dexterous hands to track challenging, long-horizon sequences including object re-arrangement and agile in-hand reorientation. These results represent a significant step toward scalable foundation controllers for manipulation that can learn entirely from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Human Motion and Animation · Human Pose and Action Recognition