Reverse Curriculum Generation for Reinforcement Learning

Carlos Florensa; David Held; Markus Wulfmeier; Michael Zhang; Pieter; Abbeel

arXiv:1707.05300·cs.AI·July 24, 2018·140 cites

Reverse Curriculum Generation for Reinforcement Learning

Carlos Florensa, David Held, Markus Wulfmeier, Michael Zhang, Pieter, Abbeel

PDF

Open Access

TL;DR

This paper introduces a reverse curriculum generation method for reinforcement learning that automatically creates a sequence of start states from which an agent can learn to reach a goal, improving training efficiency on complex tasks.

Contribution

The authors propose a novel reverse curriculum approach that learns from a single goal state without prior knowledge, enabling efficient training on challenging goal-oriented tasks.

Findings

01

Successfully applied to simulated navigation tasks

02

Outperforms state-of-the-art reinforcement learning methods

03

Effective in fine-grained manipulation problems

Abstract

Many relevant tasks require an agent to reach a certain state, or to manipulate objects into a desired configuration. For example, we might want a robot to align and assemble a gear onto an axle or insert and turn a key in a lock. These goal-oriented tasks present a considerable challenge for reinforcement learning, since their natural reward function is sparse and prohibitive amounts of exploration are required to reach the goal and receive some learning signal. Past approaches tackle these problems by exploiting expert demonstrations or by manually designing a task-specific reward shaping function to guide the learning agent. Instead, we propose a method to learn these tasks without requiring any prior knowledge other than obtaining a single state in which the task is achieved. The robot is trained in reverse, gradually learning to reach the goal from a set of start states…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Adversarial Robustness in Machine Learning