Non-Adversarial Imitation Learning and its Connections to Adversarial   Methods

Oleg Arenz; Gerhard Neumann

arXiv:2008.03525·cs.LG·August 11, 2020·6 cites

Non-Adversarial Imitation Learning and its Connections to Adversarial Methods

Oleg Arenz, Gerhard Neumann

PDF

Open Access 1 Repo

TL;DR

This paper introduces a non-adversarial framework for imitation learning that simplifies existing adversarial methods like AIRL, improves convergence guarantees, and demonstrates superior performance in offline robot learning tasks.

Contribution

It proposes a non-adversarial imitation learning framework, unifies AIRL within it, and develops new algorithms with better convergence and offline learning capabilities.

Findings

01

Non-adversarial framework simplifies AIRL derivations.

02

Proposed offline imitation learning method outperforms behavioral cloning and ValueDice.

03

Framework provides stronger convergence guarantees for imitation learning algorithms.

Abstract

Many modern methods for imitation learning and inverse reinforcement learning, such as GAIL or AIRL, are based on an adversarial formulation. These methods apply GANs to match the expert's distribution over states and actions with the implicit state-action distribution induced by the agent's policy. However, by framing imitation learning as a saddle point problem, adversarial methods can suffer from unstable optimization, and convergence can only be shown for small policy updates. We address these problems by proposing a framework for non-adversarial imitation learning. The resulting algorithms are similar to their adversarial counterparts and, thus, provide insights for adversarial imitation learning methods. Most notably, we show that AIRL is an instance of our non-adversarial formulation, which enables us to greatly simplify its derivations and obtain stronger convergence guarantees.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

OlegArenz/O-NAIL
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Robot Manipulation and Learning

MethodsGenerative Adversarial Imitation Learning