Automatic Goal Generation for Reinforcement Learning Agents

Carlos Florensa; David Held; Xinyang Geng; Pieter Abbeel

arXiv:1705.06366·cs.LG·July 24, 2018·144 cites

Automatic Goal Generation for Reinforcement Learning Agents

Carlos Florensa, David Held, Xinyang Geng, Pieter Abbeel

PDF

Open Access 1 Repo

TL;DR

This paper introduces an adversarial training framework that enables reinforcement learning agents to automatically generate and learn a diverse set of tasks, creating an adaptive curriculum without prior environment knowledge.

Contribution

It proposes a novel generator network trained adversarially to produce appropriately challenging goals, facilitating automatic curriculum generation for multi-task reinforcement learning.

Findings

01

Agents can learn multiple tasks efficiently without prior environment knowledge.

02

The method effectively handles sparse reward scenarios.

03

Automatic task generation improves learning speed and diversity.

Abstract

Reinforcement learning is a powerful technique to train an agent to perform a task. However, an agent that is trained using reinforcement learning is only capable of achieving the single task that is specified via its reward function. Such an approach does not scale well to settings in which an agent needs to perform a diverse set of tasks, such as navigating to varying positions in a room or moving objects to varying locations. Instead, we propose a method that allows an agent to automatically discover the range of tasks that it is capable of performing. We use a generator network to propose tasks for the agent to try to achieve, specified as goal states. The generator network is optimized using adversarial training to produce tasks that are always at the appropriate level of difficulty for the agent. Our method thus automatically produces a curriculum of tasks for the agent to learn.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jeffchy/Artificial-Idiot
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Autonomous Vehicle Technology and Safety