Hindsight Generative Adversarial Imitation Learning

Naijun Liu; Tao Lu; Yinghao Cai; Boyao Li; and Shuo Wang

arXiv:1903.07854·cs.LG·March 20, 2019·6 cites

Hindsight Generative Adversarial Imitation Learning

Naijun Liu, Tao Lu, Yinghao Cai, Boyao Li, and Shuo Wang

PDF

Open Access

TL;DR

This paper introduces HGAIL, a novel imitation learning algorithm that eliminates the need for expert demonstrations by combining hindsight with GAIL, enabling effective policy training in data-scarce scenarios.

Contribution

The paper proposes HGAIL, a new imitation learning method that removes the requirement for expert demonstrations by integrating hindsight into the GAIL framework.

Findings

01

HGAIL achieves comparable performance to traditional imitation learning methods.

02

The method inherently incorporates curriculum learning, aiding policy development.

03

Experiments validate HGAIL's effectiveness in demonstration-free scenarios.

Abstract

Compared to reinforcement learning, imitation learning (IL) is a powerful paradigm for training agents to learn control policies efficiently from expert demonstrations. However, in most cases, obtaining demonstration data is costly and laborious, which poses a significant challenge in some scenarios. A promising alternative is to train agent learning skills via imitation learning without expert demonstrations, which, to some extent, would extremely expand imitation learning areas. To achieve such expectation, in this paper, we propose Hindsight Generative Adversarial Imitation Learning (HGAIL) algorithm, with the aim of achieving imitation learning satisfying no need of demonstrations. Combining hindsight idea with the generative adversarial imitation learning (GAIL) framework, we realize implementing imitation learning successfully in cases of expert demonstration data are not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Human Pose and Action Recognition