RoboPaint: From Human Demonstration to Any Robot and Any View
Jiacheng Fan, Zhiyue Zhao, Yiqian Zhang, Chao Chen, Peide Wang, Hengdi Zhang, Zhengxue Cheng

TL;DR
This paper introduces RoboPaint, a scalable pipeline that converts human demonstrations into robot training data for dexterous manipulation, bypassing the need for direct robot teleoperation and enabling effective policy learning.
Contribution
The authors present a novel real-sim-real data collection and editing pipeline that transforms human demonstrations into environment-specific robot training data, including a tactile-aware retargeting method and photorealistic simulation rendering.
Findings
84% success rate in 10 manipulation tasks
80% average success rate of policies trained on generated data
Efficient and cost-effective alternative to teleoperation
Abstract
Acquiring large-scale, high-fidelity robot demonstration data remains a critical bottleneck for scaling Vision-Language-Action (VLA) models in dexterous manipulation. We propose a Real-Sim-Real data collection and data editing pipeline that transforms human demonstrations into robot-executable, environment-specific training data without direct robot teleoperation. Standardized data collection rooms are built to capture multimodal human demonstrations (synchronized 3 RGB-D videos, 11 RGB videos, 29-DoF glove joint angles, and 14-channel tactile signals). Based on these human demonstrations, we introduce a tactile-aware retargeting method that maps human hand states to robot dex-hand states via geometry and force-guided optimization. Then the retargeted robot trajectories are rendered in a photorealistic Isaac Sim environment to build robot training data. Real world experiments have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Multimodal Machine Learning Applications · Social Robot Interaction and HRI
