Task Phasing: Automated Curriculum Learning from Demonstrations

Vaibhav Bajaj; Guni Sharon; Peter Stone

arXiv:2210.10999·cs.LG·March 29, 2023

Task Phasing: Automated Curriculum Learning from Demonstrations

Vaibhav Bajaj, Guni Sharon, Peter Stone

PDF

Open Access 1 Repo

TL;DR

This paper introduces a task phasing framework that combines demonstrations and curriculum learning to improve reinforcement learning in sparse reward environments, demonstrating superior performance over existing methods.

Contribution

It proposes a novel task phasing approach that automatically generates curricula from demonstrations, with convergence guarantees and practical effectiveness.

Findings

01

Outperforms state-of-the-art methods in three sparse reward domains.

02

Provides convergence conditions for the proposed phasing approaches.

03

Effectively increases task complexity gradually to enhance learning.

Abstract

Applying reinforcement learning (RL) to sparse reward domains is notoriously challenging due to insufficient guiding signals. Common RL techniques for addressing such domains include (1) learning from demonstrations and (2) curriculum learning. While these two approaches have been studied in detail, they have rarely been considered together. This paper aims to do so by introducing a principled task phasing approach that uses demonstrations to automatically generate a curriculum sequence. Using inverse RL from (suboptimal) demonstrations we define a simple initial task. Our task phasing approach then provides a framework to gradually increase the complexity of the task all the way to the target task, while retuning the RL agent in each phasing iteration. Two approaches for phasing are considered: (1) gradually increasing the proportion of time steps an RL agent is in control, and (2)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

paranoidandroid96/task-phasing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Receptor Mechanisms and Signaling · Adversarial Robustness in Machine Learning