DexMV: Imitation Learning for Dexterous Manipulation from Human Videos

Yuzhe Qin; Yueh-Hua Wu; Shaowei Liu; Hanwen Jiang; Ruihan Yang; Yang; Fu; Xiaolong Wang

arXiv:2108.05877·cs.LG·July 7, 2022

DexMV: Imitation Learning for Dexterous Manipulation from Human Videos

Yuzhe Qin, Yueh-Hua Wu, Shaowei Liu, Hanwen Jiang, Ruihan Yang, Yang, Fu, Xiaolong Wang

PDF

Open Access 1 Repo

TL;DR

DexMV introduces a platform and pipeline for imitation learning from human videos, enabling robots to perform complex dexterous manipulation tasks by translating human demonstrations into robot actions.

Contribution

The paper presents a novel system combining simulation, vision, and imitation learning to improve dexterous manipulation in robots from human video demonstrations.

Findings

01

Demonstrations significantly improve robot learning performance.

02

The pipeline enables solving complex tasks beyond reinforcement learning capabilities.

03

Benchmarking shows effectiveness of various imitation learning algorithms.

Abstract

While significant progress has been made on understanding hand-object interactions in computer vision, it is still very challenging for robots to perform complex dexterous manipulation. In this paper, we propose a new platform and pipeline DexMV (Dexterous Manipulation from Videos) for imitation learning. We design a platform with: (i) a simulation system for complex dexterous manipulation tasks with a multi-finger robot hand and (ii) a computer vision system to record large-scale demonstrations of a human hand conducting the same tasks. In our novel pipeline, we extract 3D hand and object poses from videos, and propose a novel demonstration translation method to convert human motion to robot demonstrations. We then apply and benchmark multiple imitation learning algorithms with the demonstrations. We show that the demonstrations can indeed improve robot learning by a large margin and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yzqin/dexmv-sim
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Hand Gesture Recognition Systems