Wasserstein Adversarial Imitation Learning

Huang Xiao; Michael Herman; Joerg Wagner; Sebastian Ziesche; Jalal; Etesami; Thai Hong Linh

arXiv:1906.08113·cs.LG·June 20, 2019·35 cites

Wasserstein Adversarial Imitation Learning

Huang Xiao, Michael Herman, Joerg Wagner, Sebastian Ziesche, Jalal, Etesami, Thai Hong Linh

PDF

Open Access

TL;DR

This paper introduces Wasserstein Adversarial Imitation Learning, a method that uses optimal transport theory to improve imitation learning by enabling smooth reward functions and high sample efficiency, demonstrated in robotic tasks.

Contribution

It proposes a novel imitation learning approach leveraging optimal transport and Kantorovich potentials, enhancing reward function flexibility and sample efficiency.

Findings

01

Outperforms baselines in average cumulative rewards

02

Requires only one expert demonstration for high sample efficiency

03

Effective in large-scale robotic applications

Abstract

Imitation Learning describes the problem of recovering an expert policy from demonstrations. While inverse reinforcement learning approaches are known to be very sample-efficient in terms of expert demonstrations, they usually require problem-dependent reward functions or a (task-)specific reward-function regularization. In this paper, we show a natural connection between inverse reinforcement learning approaches and Optimal Transport, that enables more general reward functions with desirable properties (e.g., smoothness). Based on our observation, we propose a novel approach called Wasserstein Adversarial Imitation Learning. Our approach considers the Kantorovich potentials as a reward function and further leverages regularized optimal transport to enable large-scale applications. In several robotic experiments, our approach outperforms the baselines in terms of average cumulative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Robot Manipulation and Learning · Reinforcement Learning in Robotics