Autonomous Curriculum Design via Relative Entropy Based Task   Modifications

Muhammed Yusuf Satici; Jianxun Wang; David L. Roberts

arXiv:2502.21166·cs.LG·March 3, 2025

Autonomous Curriculum Design via Relative Entropy Based Task Modifications

Muhammed Yusuf Satici, Jianxun Wang, David L. Roberts

PDF

TL;DR

This paper introduces an autonomous curriculum design method that uses relative entropy to select high-uncertainty tasks, improving learning efficiency without human intervention.

Contribution

It presents a novel uncertainty-based curriculum design algorithm with theoretical convergence guarantees, outperforming existing methods and supporting both autonomous and teacher-guided learning.

Findings

01

Outperforms random curriculum and direct target task learning

02

Provides theoretical guarantees for convergence

03

Supports hybrid autonomous and teacher-guided curriculum design

Abstract

Curriculum learning is a training method in which an agent is first trained on a curriculum of relatively simple tasks related to a target task in an effort to shorten the time required to train on the target task. Autonomous curriculum design involves the design of such curriculum with no reliance on human knowledge and/or expertise. Finding an efficient and effective way of autonomously designing curricula remains an open problem. We propose a novel approach for automatically designing curricula by leveraging the learner's uncertainty to select curricula tasks. Our approach measures the uncertainty in the learner's policy using relative entropy, and guides the agent to states of high uncertainty to facilitate learning. Our algorithm supports the generation of autonomous curricula in a self-assessed manner by leveraging the learner's past and current policies but it also allows the use…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.