What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar, Danfei Xu, Josiah Wong, Soroush Nasiriany, Chen Wang,, Rohun Kulkarni, Li Fei-Fei, Silvio Savarese, Yuke Zhu, Roberto, Mart\'in-Mart\'in

TL;DR
This paper provides an extensive empirical analysis of offline learning algorithms for robot manipulation from human demonstrations, highlighting challenges, lessons, and opportunities, and offers open-source datasets and code for future research.
Contribution
It conducts a comprehensive study of six offline algorithms across multiple tasks, revealing key insights and providing open datasets and implementations to advance the field.
Findings
Sensitivity to algorithmic design choices
Dependence on demonstration quality
Variability based on training and evaluation criteria
Abstract
Imitating human demonstrations is a promising approach to endow robots with various manipulation capabilities. While recent advances have been made in imitation learning and batch (offline) reinforcement learning, a lack of open-source human datasets and reproducible learning methods make assessing the state of the field difficult. In this paper, we conduct an extensive study of six offline learning algorithms for robot manipulation on five simulated and three real-world multi-stage manipulation tasks of varying complexity, and with datasets of varying quality. Our study analyzes the most critical challenges when learning from offline human data for manipulation. Based on the study, we derive a series of lessons including the sensitivity to different algorithmic design choices, the dependence on the quality of the demonstrations, and the variability based on the stopping criteria due to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Adversarial Robustness in Machine Learning
