A Probabilistic Interpretation of Self-Paced Learning with Applications   to Reinforcement Learning

Pascal Klink; Hany Abdulsamad; Boris Belousov; Carlo D'Eramo; Jan; Peters; Joni Pajarinen

arXiv:2102.13176·cs.LG·September 3, 2021

A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning

Pascal Klink, Hany Abdulsamad, Boris Belousov, Carlo D'Eramo, Jan, Peters, Joni Pajarinen

PDF

Open Access 1 Repo

TL;DR

This paper provides a theoretical framework for self-paced learning in reinforcement learning, showing how it can generate curricula that improve training efficiency and avoid local optima across various RL tasks.

Contribution

It formalizes self-paced learning as a distribution over tasks, offering a theoretical basis for automated curriculum generation in RL.

Findings

01

Training on the induced task distribution helps avoid poor local optima.

02

The approach improves performance in RL tasks with uninformative rewards.

03

It provides a theoretical understanding of curriculum generation in RL.

Abstract

Across machine learning, the use of curricula has shown strong empirical potential to improve learning from data by avoiding local optima of training objectives. For reinforcement learning (RL), curricula are especially interesting, as the underlying optimization has a strong tendency to get stuck in local optima due to the exploration-exploitation trade-off. Recently, a number of approaches for an automatic generation of curricula for RL have been shown to increase performance while requiring less expert knowledge compared to manually designed curricula. However, these approaches are seldomly investigated from a theoretical perspective, preventing a deeper understanding of their mechanics. In this paper, we present an approach for automated curriculum generation in RL with a clear theoretical underpinning. More precisely, we formalize the well-known self-paced learning paradigm as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

psclklnk/self-paced-rl
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Machine Learning and Data Classification