On Combining Expert Demonstrations in Imitation Learning via Optimal   Transport

Ilana Sebag; Samuel Cohen; Marc Peter Deisenroth

arXiv:2307.10810·cs.LG·July 21, 2023

On Combining Expert Demonstrations in Imitation Learning via Optimal Transport

Ilana Sebag, Samuel Cohen, Marc Peter Deisenroth

PDF

Open Access

TL;DR

This paper introduces a novel multi-marginal optimal transport approach for combining multiple expert demonstrations in imitation learning, improving the way diverse trajectories are integrated for better policy learning.

Contribution

It proposes a multi-marginal optimal transport method to effectively combine multiple expert demonstrations, addressing limitations of standard concatenation techniques.

Findings

01

The proposed method outperforms standard concatenation in diverse environments.

02

It provides a more meaningful geometric average of multiple demonstrations.

03

Efficiency is validated on OpenAI Gym control tasks.

Abstract

Imitation learning (IL) seeks to teach agents specific tasks through expert demonstrations. One of the key approaches to IL is to define a distance between agent and expert and to find an agent policy that minimizes that distance. Optimal transport methods have been widely used in imitation learning as they provide ways to measure meaningful distances between agent and expert trajectories. However, the problem of how to optimally combine multiple expert demonstrations has not been widely studied. The standard method is to simply concatenate state (-action) trajectories, which is problematic when trajectories are multi-modal. We propose an alternative method that uses a multi-marginal optimal transport distance and enables the combination of multiple and diverse state-trajectories in the OT sense, providing a more sensible geometric average of the demonstrations. Our approach enables an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Multimodal Machine Learning Applications