What Matters in Learning from Offline Human Demonstrations for Robot   Manipulation

Ajay Mandlekar; Danfei Xu; Josiah Wong; Soroush Nasiriany; Chen Wang,; Rohun Kulkarni; Li Fei-Fei; Silvio Savarese; Yuke Zhu; Roberto; Mart\'in-Mart\'in

arXiv:2108.03298·cs.RO·September 28, 2021·70 cites

What Matters in Learning from Offline Human Demonstrations for Robot Manipulation

Ajay Mandlekar, Danfei Xu, Josiah Wong, Soroush Nasiriany, Chen Wang,, Rohun Kulkarni, Li Fei-Fei, Silvio Savarese, Yuke Zhu, Roberto, Mart\'in-Mart\'in

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper provides an extensive empirical analysis of offline learning algorithms for robot manipulation from human demonstrations, highlighting challenges, lessons, and opportunities, and offers open-source datasets and code for future research.

Contribution

It conducts a comprehensive study of six offline algorithms across multiple tasks, revealing key insights and providing open datasets and implementations to advance the field.

Findings

01

Sensitivity to algorithmic design choices

02

Dependence on demonstration quality

03

Variability based on training and evaluation criteria

Abstract

Imitating human demonstrations is a promising approach to endow robots with various manipulation capabilities. While recent advances have been made in imitation learning and batch (offline) reinforcement learning, a lack of open-source human datasets and reproducible learning methods make assessing the state of the field difficult. In this paper, we conduct an extensive study of six offline learning algorithms for robot manipulation on five simulated and three real-world multi-stage manipulation tasks of varying complexity, and with datasets of varying quality. Our study analyzes the most critical challenges when learning from offline human data for manipulation. Based on the study, we derive a series of lessons including the sensitivity to different algorithmic design choices, the dependence on the quality of the demonstrations, and the variability based on the stopping criteria due to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ARISE-Initiative/robomimic
pytorch

Datasets

robomimic/robomimic_datasets
dataset· 1.8k dl
1.8k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Adversarial Robustness in Machine Learning