Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
Jiaheng Hu, Zizhao Wang, Peter Stone, Roberto Mart\'in-Mart\'in

TL;DR
DUSDi introduces a method for learning disentangled skills in reinforcement learning, enabling efficient reuse and chaining of skills for solving complex downstream tasks, outperforming previous methods.
Contribution
The paper proposes a novel mutual-information-based objective and value factorization approach for learning disentangled skills in an unsupervised manner.
Findings
Successfully learns disentangled skills in challenging environments.
Significantly outperforms previous skill discovery methods on downstream tasks.
Enables efficient skill chaining for hierarchical reinforcement learning.
Abstract
A hallmark of intelligent agents is the ability to learn reusable skills purely from unsupervised interaction with the environment. However, existing unsupervised skill discovery methods often learn entangled skills where one skill variable simultaneously influences many entities in the environment, making downstream skill chaining extremely challenging. We propose Disentangled Unsupervised Skill Discovery (DUSDi), a method for learning disentangled skills that can be efficiently reused to solve downstream tasks. DUSDi decomposes skills into disentangled components, where each skill component only affects one factor of the state space. Importantly, these skill components can be concurrently composed to generate low-level actions, and efficiently chained to tackle downstream tasks through hierarchical Reinforcement Learning. DUSDi defines a novel mutual-information-based objective to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Machine Learning and Data Classification
MethodsSparse Evolutionary Training
