DEALIO: Data-Efficient Adversarial Learning for Imitation from   Observation

Faraz Torabi; Garrett Warnell; Peter Stone

arXiv:2104.00163·cs.LG·April 2, 2021

DEALIO: Data-Efficient Adversarial Learning for Imitation from Observation

Faraz Torabi, Garrett Warnell, Peter Stone

PDF

Open Access

TL;DR

This paper introduces DEALIO, a data-efficient adversarial imitation learning method that combines model-based RL techniques with adversarial methods for imitation from observation, significantly reducing sample complexity.

Contribution

It proposes integrating linear-quadratic regulator and path integral policy improvement into adversarial IfO, enhancing data efficiency without performance loss.

Findings

01

Achieves similar or better performance with fewer environment interactions.

02

Demonstrates effectiveness across four simulation domains.

03

Reduces sample complexity compared to existing methods.

Abstract

In imitation learning from observation IfO, a learning agent seeks to imitate a demonstrating agent using only observations of the demonstrated behavior without access to the control signals generated by the demonstrator. Recent methods based on adversarial imitation learning have led to state-of-the-art performance on IfO problems, but they typically suffer from high sample complexity due to a reliance on data-inefficient, model-free reinforcement learning algorithms. This issue makes them impractical to deploy in real-world settings, where gathering samples can incur high costs in terms of time, energy, and risk. In this work, we hypothesize that we can incorporate ideas from model-based reinforcement learning with adversarial methods for IfO in order to increase the data efficiency of these methods without sacrificing performance. Specifically, we consider time-varying linear…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)