Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations
Xiaogang Jia, Denis Blessing, Xinkai Jiang, Moritz Reuss, Atalay, Donat, Rudolf Lioutikov, Gerhard Neumann

TL;DR
This paper introduces a new benchmark and datasets for evaluating imitation learning algorithms on their ability to learn and reproduce diverse, multi-modal human behaviors in complex, multi-object manipulation tasks, addressing a key challenge in the field.
Contribution
The work presents simulation environments, datasets, and metrics specifically designed to evaluate and quantify a model's capacity to learn diverse human behaviors in imitation learning.
Findings
State-of-the-art methods show limited ability to capture behavior diversity.
The proposed metrics provide meaningful insights into model robustness and versatility.
Benchmark results highlight gaps and future directions for imitation learning algorithms.
Abstract
Imitation learning with human data has demonstrated remarkable success in teaching robots in a wide range of skills. However, the inherent diversity in human behavior leads to the emergence of multi-modal data distributions, thereby presenting a formidable challenge for existing imitation learning algorithms. Quantifying a model's capacity to capture and replicate this diversity effectively is still an open problem. In this work, we introduce simulation benchmark environments and the corresponding Datasets with Diverse human Demonstrations for Imitation Learning (D3IL), designed explicitly to evaluate a model's ability to learn multi-modal behavior. Our environments are designed to involve multiple sub-tasks that need to be solved, consider manipulation of multiple objects which increases the diversity of the behavior and can only be solved by policies that rely on closed loop sensory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Reinforcement Learning in Robotics · Multimodal Machine Learning Applications
