DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning

Leander Diaz-Bone; Marco Bagatella; Jonas H\"ubotter; Andreas Krause

arXiv:2505.19850·cs.LG·October 21, 2025

DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning

Leander Diaz-Bone, Marco Bagatella, Jonas H\"ubotter, Andreas Krause

PDF

Open Access 1 Repo

TL;DR

DISCOVER introduces an automated curriculum method for sparse-reward reinforcement learning that directs exploration towards relevant goals, significantly improving performance on complex, high-dimensional tasks without prior task-specific information.

Contribution

The paper presents DISCOVER, a novel goal-directed exploration method that leverages existing RL algorithms to efficiently solve long-horizon sparse-reward tasks by selecting relevant exploratory goals.

Findings

01

DISCOVER outperforms prior exploration methods in high-dimensional environments.

02

The method provides theoretical bounds on the time to achieve target tasks.

03

Directed goal selection enhances exploration efficiency in complex RL tasks.

Abstract

Sparse-reward reinforcement learning (RL) can model a wide range of highly complex tasks. Solving sparse-reward tasks is RL's core premise, requiring efficient exploration coupled with long-horizon credit assignment, and overcoming these challenges is key for building self-improving agents with superhuman ability. Prior work commonly explores with the objective of solving many sparse-reward tasks, making exploration of individual high-dimensional, long-horizon tasks intractable. We argue that solving such challenging tasks requires solving simpler tasks that are relevant to the target task, i.e., whose achieval will teach the agent skills required for solving the target task. We demonstrate that this sense of direction, necessary for effective exploration, can be extracted from existing RL algorithms, without leveraging any prior information. To this end, we propose a method for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leanderdiazbone/discover
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBehavioral and Psychological Studies · Software Engineering Research · Advanced Software Engineering Methodologies