Automating Curriculum Learning for Reinforcement Learning using a   Skill-Based Bayesian Network

Vincent Hsiao; Mark Roberts; Laura M. Hiatt; George Konidaris; Dana; Nau

arXiv:2502.15662·cs.AI·February 24, 2025

Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian Network

Vincent Hsiao, Mark Roberts, Laura M. Hiatt, George Konidaris, Dana, Nau

PDF

TL;DR

This paper presents SEBNs, a probabilistic model that automates curriculum generation in reinforcement learning by predicting skill success and selecting tasks to improve training efficiency across diverse environments.

Contribution

Introduction of SEBNs, a novel Bayesian network model that predicts policy performance and guides automated curriculum learning in reinforcement learning tasks.

Findings

01

SEBN-based curricula outperform baseline methods in various environments.

02

The approach reduces training time and improves policy success rates.

03

Effective across gridworld, continuous control, and robotics simulations.

Abstract

A major challenge for reinforcement learning is automatically generating curricula to reduce training time or improve performance in some target task. We introduce SEBNs (Skill-Environment Bayesian Networks) which model a probabilistic relationship between a set of skills, a set of goals that relate to the reward structure, and a set of environment features to predict policy performance on (possibly unseen) tasks. We develop an algorithm that uses the inferred estimates of agent success from SEBN to weigh the possible next tasks by expected improvement. We evaluate the benefit of the resulting curriculum on three environments: a discrete gridworld, continuous control, and simulated robotics. The results show that curricula constructed using SEBN frequently outperform other baselines.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training