Bridging the Human to Robot Dexterity Gap through Object-Oriented   Rewards

Irmak Guzey; Yinlong Dai; Georgy Savva; Raunaq Bhirangi; Lerrel Pinto

arXiv:2410.23289·cs.RO·October 31, 2024

Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards

Irmak Guzey, Yinlong Dai, Georgy Savva, Raunaq Bhirangi, Lerrel Pinto

PDF

Open Access

TL;DR

HuDOR is a novel method that leverages object-oriented rewards from human videos to enable fast, online fine-tuning of multi-fingered robot hands, bridging the gap between human demonstrations and robotic dexterity.

Contribution

This work introduces HuDOR, a reward computation technique that uses object trajectories to adapt policies from human videos to robot hands, overcoming morphology and visual differences.

Findings

01

HuDOR enables learning a task in about an hour of online interaction.

02

Achieves a 4x performance improvement over baseline methods.

03

Successfully transfers human demonstration videos to multi-fingered robot manipulation tasks.

Abstract

Training robots directly from human videos is an emerging area in robotics and computer vision. While there has been notable progress with two-fingered grippers, learning autonomous tasks for multi-fingered robot hands in this way remains challenging. A key reason for this difficulty is that a policy trained on human hands may not directly transfer to a robot hand due to morphology differences. In this work, we present HuDOR, a technique that enables online fine-tuning of policies by directly computing rewards from human videos. Importantly, this reward function is built using object-oriented trajectories derived from off-the-shelf point trackers, providing meaningful learning signals despite the morphology gap and visual differences between human and robot hands. Given a single video of a human solving a task, such as gently opening a music box, HuDOR enables our four-fingered Allegro…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotics and Automated Systems · Robotic Path Planning Algorithms