Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards
Irmak Guzey, Yinlong Dai, Georgy Savva, Raunaq Bhirangi, Lerrel Pinto

TL;DR
HuDOR is a novel method that leverages object-oriented rewards from human videos to enable fast, online fine-tuning of multi-fingered robot hands, bridging the gap between human demonstrations and robotic dexterity.
Contribution
This work introduces HuDOR, a reward computation technique that uses object trajectories to adapt policies from human videos to robot hands, overcoming morphology and visual differences.
Findings
HuDOR enables learning a task in about an hour of online interaction.
Achieves a 4x performance improvement over baseline methods.
Successfully transfers human demonstration videos to multi-fingered robot manipulation tasks.
Abstract
Training robots directly from human videos is an emerging area in robotics and computer vision. While there has been notable progress with two-fingered grippers, learning autonomous tasks for multi-fingered robot hands in this way remains challenging. A key reason for this difficulty is that a policy trained on human hands may not directly transfer to a robot hand due to morphology differences. In this work, we present HuDOR, a technique that enables online fine-tuning of policies by directly computing rewards from human videos. Importantly, this reward function is built using object-oriented trajectories derived from off-the-shelf point trackers, providing meaningful learning signals despite the morphology gap and visual differences between human and robot hands. Given a single video of a human solving a task, such as gently opening a music box, HuDOR enables our four-fingered Allegro…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotics and Automated Systems · Robotic Path Planning Algorithms
