Learning from Guided Play: Improving Exploration for Adversarial   Imitation Learning with Simple Auxiliary Tasks

Trevor Ablett; Bryan Chan; Jonathan Kelly

arXiv:2301.00051·cs.LG·October 16, 2023

Learning from Guided Play: Improving Exploration for Adversarial Imitation Learning with Simple Auxiliary Tasks

Trevor Ablett, Bryan Chan, Jonathan Kelly

PDF

Open Access 1 Repo

TL;DR

This paper introduces Learning from Guided Play (LfGP), a framework that enhances exploration in adversarial imitation learning by incorporating auxiliary tasks and expert demonstrations, leading to better performance in robotic manipulation tasks.

Contribution

The paper proposes LfGP, a novel approach that leverages auxiliary tasks and expert data to improve exploration and sample efficiency in adversarial imitation learning.

Findings

01

LfGP outperforms AIL and behavior cloning in manipulation tasks.

02

LfGP is more sample-efficient than baseline methods.

03

Analysis reveals better exploration reduces local maxima issues.

Abstract

Adversarial imitation learning (AIL) has become a popular alternative to supervised imitation learning that reduces the distribution shift suffered by the latter. However, AIL requires effective exploration during an online reinforcement learning phase. In this work, we show that the standard, naive approach to exploration can manifest as a suboptimal local maximum if a policy learned with AIL sufficiently matches the expert distribution without fully learning the desired task. This can be particularly catastrophic for manipulation tasks, where the difference between an expert and a non-expert state-action pair is often subtle. We present Learning from Guided Play (LfGP), a framework in which we leverage expert demonstrations of multiple exploratory, auxiliary tasks in addition to a main task. The addition of these auxiliary tasks forces the agent to explore states and actions that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

utiasstars/lfgp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning