Sample-Efficient Imitation Learning via Generative Adversarial Nets

Lionel Blond\'e; Alexandros Kalousis

arXiv:1809.02064·cs.LG·March 11, 2019·21 cites

Sample-Efficient Imitation Learning via Generative Adversarial Nets

Lionel Blond\'e, Alexandros Kalousis

PDF

Open Access 3 Repos

TL;DR

This paper introduces a highly sample-efficient imitation learning method based on adversarial training and off-policy actor-critic architecture, significantly reducing environment interactions needed for learning in continuous control tasks.

Contribution

The paper presents a novel framework that combines self-tuned adversarial surrogate rewards with off-policy learning to dramatically improve sample efficiency in imitation learning.

Findings

01

Achieves several orders of magnitude reduction in environment interactions

02

Maintains stable and effective imitation policies across various tasks

03

Simple to implement and applicable in continuous control environments

Abstract

GAIL is a recent successful imitation learning architecture that exploits the adversarial training procedure introduced in GANs. Albeit successful at generating behaviours similar to those demonstrated to the agent, GAIL suffers from a high sample complexity in the number of interactions it has to carry out in the environment in order to achieve satisfactory performance. We dramatically shrink the amount of interactions with the environment necessary to learn well-behaved imitation policies, by up to several orders of magnitude. Our framework, operating in the model-free regime, exhibits a significant increase in sample-efficiency over previous methods by simultaneously a) learning a self-tuned adversarially-trained surrogate reward and b) leveraging an off-policy actor-critic architecture. We show that our approach is simple to implement and that the learned agents remain remarkably…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Model Reduction and Neural Networks