Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human   Demonstrations

Xiaogang Jia; Denis Blessing; Xinkai Jiang; Moritz Reuss; Atalay; Donat; Rudolf Lioutikov; Gerhard Neumann

arXiv:2402.14606·cs.RO·February 23, 2024·1 cites

Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations

Xiaogang Jia, Denis Blessing, Xinkai Jiang, Moritz Reuss, Atalay, Donat, Rudolf Lioutikov, Gerhard Neumann

PDF

Open Access

TL;DR

This paper introduces a new benchmark and datasets for evaluating imitation learning algorithms on their ability to learn and reproduce diverse, multi-modal human behaviors in complex, multi-object manipulation tasks, addressing a key challenge in the field.

Contribution

The work presents simulation environments, datasets, and metrics specifically designed to evaluate and quantify a model's capacity to learn diverse human behaviors in imitation learning.

Findings

01

State-of-the-art methods show limited ability to capture behavior diversity.

02

The proposed metrics provide meaningful insights into model robustness and versatility.

03

Benchmark results highlight gaps and future directions for imitation learning algorithms.

Abstract

Imitation learning with human data has demonstrated remarkable success in teaching robots in a wide range of skills. However, the inherent diversity in human behavior leads to the emergence of multi-modal data distributions, thereby presenting a formidable challenge for existing imitation learning algorithms. Quantifying a model's capacity to capture and replicate this diversity effectively is still an open problem. In this work, we introduce simulation benchmark environments and the corresponding Datasets with Diverse human Demonstrations for Imitation Learning (D3IL), designed explicitly to evaluate a model's ability to learn multi-modal behavior. Our environments are designed to involve multiple sub-tasks that need to be solved, consider manipulation of multiple objects which increases the diversity of the behavior and can only be solved by policies that rely on closed loop sensory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Reinforcement Learning in Robotics · Multimodal Machine Learning Applications