Exploration by Learning Diverse Skills through Successor State Measures
Paul-Antoine Le Tolguenec, Yann Besse, Florent Teichteil-Konigsbuch,, Dennis G. Wilson, Emmanuel Rachelson

TL;DR
This paper introduces LEADS, a method for constructing diverse skills that uniformly cover the state space by leveraging successor state measures, enhancing exploration without relying on rewards or bonuses.
Contribution
The paper formalizes a new approach to skill diversity using successor state measures, improving exploration in reinforcement learning tasks.
Findings
LEADS effectively covers the entire state space in maze and robotic tasks.
The method promotes robust and efficient exploration without reward shaping.
It combines mutual information maximization with exploration bonuses.
Abstract
The ability to perform different skills can encourage agents to explore. In this work, we aim to construct a set of diverse skills which uniformly cover the state space. We propose a formalization of this search for diverse skills, building on a previous definition based on the mutual information between states and skills. We consider the distribution of states reached by a policy conditioned on each skill and leverage the successor state measure to maximize the difference between these skill distributions. We call this approach LEADS: Learning Diverse Skills through Successor States. We demonstrate our approach on a set of maze navigation and robotic control tasks which show that our method is capable of constructing a diverse set of skills which exhaustively cover the state space without relying on reward or exploration bonuses. Our findings demonstrate that this new formalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHigher Education Learning Practices · Competency Development and Evaluation · Human Resource Development and Performance Evaluation
MethodsSparse Evolutionary Training
