Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory   GANs

Himanshu Sahni; Toby Buckley; Pieter Abbeel; Ilya Kuzovkin

arXiv:1901.11529·cs.AI·October 31, 2019·5 cites

Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs

Himanshu Sahni, Toby Buckley, Pieter Abbeel, Ilya Kuzovkin

PDF

Open Access 2 Repos

TL;DR

This paper enhances reinforcement learning in visual environments by combining hallucinated visual trajectories generated by a GAN with HER, significantly improving sample efficiency in 3D navigation and robotics tasks.

Contribution

It introduces a novel method that uses hallucinated visual trajectories with HER to address sample complexity in visual RL tasks.

Findings

01

Marked improvement over baselines in 3D navigation tasks

02

Effective use of GAN-generated visual trajectories

03

Enhanced sample efficiency in visual reinforcement learning

Abstract

Reinforcement Learning (RL) algorithms typically require millions of environment interactions to learn successful policies in sparse reward settings. Hindsight Experience Replay (HER) was introduced as a technique to increase sample efficiency by reimagining unsuccessful trajectories as successful ones by altering the originally intended goals. However, it cannot be directly applied to visual environments where goal states are often characterized by the presence of distinct visual features. In this work, we show how visual trajectories can be hallucinated to appear successful by altering agent observations using a generative model trained on relatively few snapshots of the goal. We then use this model in combination with HER to train RL agents in visual settings. We validate our approach on 3D navigation tasks and a simulated robotics application and show marked improvement over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Embodied and Extended Cognition · Mental Health Research Topics

MethodsExperience Replay