SynH2R: Synthesizing Hand-Object Motions for Learning Human-to-Robot Handovers
Sammy Christen, Lan Feng, Wei Yang, Yu-Wei Chao, Otmar, Hilliges, Jie Song

TL;DR
This paper presents a novel framework for generating synthetic human grasping motions to train robots for human-to-robot handovers, reducing reliance on costly real human motion data and enabling large-scale object handling.
Contribution
The authors introduce a hand-object synthesis method that generates plausible handover motions, allowing training with synthetic data for improved scalability and performance.
Findings
Synthetic data training achieves competitive results with real data-based methods.
The method scales to over 100 times more objects than previous approaches.
Models trained with synthetic data perform well on unseen objects and motions.
Abstract
Vision-based human-to-robot handover is an important and challenging task in human-robot interaction. Recent work has attempted to train robot policies by interacting with dynamic virtual humans in simulated environments, where the policies can later be transferred to the real world. However, a major bottleneck is the reliance on human motion capture data, which is expensive to acquire and difficult to scale to arbitrary objects and human grasping motions. In this paper, we introduce a framework that can generate plausible human grasping motions suitable for training the robot. To achieve this, we propose a hand-object synthesis method that is designed to generate handover-friendly motions similar to humans. This allows us to generate synthetic training and testing data with 100x more objects than previous work. In our experiments, we show that our method trained purely with synthetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Human Motion and Animation
