Meta Automatic Curriculum Learning
R\'emy Portelas, Cl\'ement Romac, Katja Hofmann, Pierre-Yves Oudeyer

TL;DR
This paper introduces Meta-ACL, a novel approach to automatic curriculum learning that generalizes across different RL learners, improving training efficiency in diverse, procedurally generated environments.
Contribution
It formalizes Meta-ACL for black-box RL learners and presents AGAIN, the first implementation demonstrating its advantages over traditional ACL methods.
Findings
Meta-ACL outperforms classical ACL in diverse environments.
AGAIN effectively adapts curriculum generation across different learners.
Procedurally generated parkour environments validate the approach.
Abstract
A major challenge in the Deep RL (DRL) community is to train agents able to generalize their control policy over situations never seen in training. Training on diverse tasks has been identified as a key ingredient for good generalization, which pushed researchers towards using rich procedural task generation systems controlled through complex continuous parameter spaces. In such complex task spaces, it is essential to rely on some form of Automatic Curriculum Learning (ACL) to adapt the task sampling distribution to a given learning agent, instead of randomly sampling tasks, as many could end up being either trivial or unfeasible. Since it is hard to get prior knowledge on such task spaces, many ACL algorithms explore the task space to detect progress niches over time, a costly tabula-rasa process that needs to be performed for each new learning agents, although they might have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Teaching and Learning Programming · Online Learning and Analytics
