Automatic Goal Generation for Reinforcement Learning Agents
Carlos Florensa, David Held, Xinyang Geng, Pieter Abbeel

TL;DR
This paper introduces an adversarial training framework that enables reinforcement learning agents to automatically generate and learn a diverse set of tasks, creating an adaptive curriculum without prior environment knowledge.
Contribution
It proposes a novel generator network trained adversarially to produce appropriately challenging goals, facilitating automatic curriculum generation for multi-task reinforcement learning.
Findings
Agents can learn multiple tasks efficiently without prior environment knowledge.
The method effectively handles sparse reward scenarios.
Automatic task generation improves learning speed and diversity.
Abstract
Reinforcement learning is a powerful technique to train an agent to perform a task. However, an agent that is trained using reinforcement learning is only capable of achieving the single task that is specified via its reward function. Such an approach does not scale well to settings in which an agent needs to perform a diverse set of tasks, such as navigating to varying positions in a room or moving objects to varying locations. Instead, we propose a method that allows an agent to automatically discover the range of tasks that it is capable of performing. We use a generator network to propose tasks for the agent to try to achieve, specified as goal states. The generator network is optimized using adversarial training to produce tasks that are always at the appropriate level of difficulty for the agent. Our method thus automatically produces a curriculum of tasks for the agent to learn.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Autonomous Vehicle Technology and Safety
