DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Leander Diaz-Bone, Marco Bagatella, Jonas H\"ubotter, Andreas Krause

TL;DR
DISCOVER introduces an automated curriculum method for sparse-reward reinforcement learning that directs exploration towards relevant goals, significantly improving performance on complex, high-dimensional tasks without prior task-specific information.
Contribution
The paper presents DISCOVER, a novel goal-directed exploration method that leverages existing RL algorithms to efficiently solve long-horizon sparse-reward tasks by selecting relevant exploratory goals.
Findings
DISCOVER outperforms prior exploration methods in high-dimensional environments.
The method provides theoretical bounds on the time to achieve target tasks.
Directed goal selection enhances exploration efficiency in complex RL tasks.
Abstract
Sparse-reward reinforcement learning (RL) can model a wide range of highly complex tasks. Solving sparse-reward tasks is RL's core premise, requiring efficient exploration coupled with long-horizon credit assignment, and overcoming these challenges is key for building self-improving agents with superhuman ability. Prior work commonly explores with the objective of solving many sparse-reward tasks, making exploration of individual high-dimensional, long-horizon tasks intractable. We argue that solving such challenging tasks requires solving simpler tasks that are relevant to the target task, i.e., whose achieval will teach the agent skills required for solving the target task. We demonstrate that this sense of direction, necessary for effective exploration, can be extracted from existing RL algorithms, without leveraging any prior information. To this end, we propose a method for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBehavioral and Psychological Studies · Software Engineering Research · Advanced Software Engineering Methodologies
