Reverse Forward Curriculum Learning for Extreme Sample and Demonstration   Efficiency in Reinforcement Learning

Stone Tao; Arth Shukla; Tse-kai Chan; Hao Su

arXiv:2405.03379·cs.LG·May 7, 2024·1 cites

Reverse Forward Curriculum Learning for Extreme Sample and Demonstration Efficiency in Reinforcement Learning

Stone Tao, Arth Shukla, Tse-kai Chan, Hao Su

PDF

Open Access 1 Repo

TL;DR

This paper introduces RFCL, a reinforcement learning method that combines reverse and forward curricula to efficiently leverage multiple demonstrations, significantly improving sample and demonstration efficiency in complex tasks.

Contribution

RFCL uniquely utilizes multiple demonstrations with per-demonstration reverse curricula, enhancing initial policy quality and accelerating learning in sparse reward environments.

Findings

01

RFCL outperforms state-of-the-art baselines in demonstration efficiency.

02

RFCL solves previously unsolvable high-precision tasks.

03

Significant reduction in environment interactions needed for complex tasks.

Abstract

Reinforcement learning (RL) presents a promising framework to learn policies through environment interaction, but often requires an infeasible amount of interaction data to solve complex tasks from sparse rewards. One direction includes augmenting RL with offline data demonstrating desired tasks, but past work often require a lot of high-quality demonstration data that is difficult to obtain, especially for domains such as robotics. Our approach consists of a reverse curriculum followed by a forward curriculum. Unique to our approach compared to past work is the ability to efficiently leverage more than one demonstration via a per-demonstration reverse curriculum generated via state resets. The result of our reverse curriculum is an initial policy that performs well on a narrow initial state distribution and helps overcome difficult exploration problems. A forward curriculum is then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stonet2000/rfcl
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications