Disentangled Unsupervised Skill Discovery for Efficient Hierarchical   Reinforcement Learning

Jiaheng Hu; Zizhao Wang; Peter Stone; Roberto Mart\'in-Mart\'in

arXiv:2410.11251·cs.LG·October 16, 2024

Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning

Jiaheng Hu, Zizhao Wang, Peter Stone, Roberto Mart\'in-Mart\'in

PDF

Open Access 1 Video

TL;DR

DUSDi introduces a method for learning disentangled skills in reinforcement learning, enabling efficient reuse and chaining of skills for solving complex downstream tasks, outperforming previous methods.

Contribution

The paper proposes a novel mutual-information-based objective and value factorization approach for learning disentangled skills in an unsupervised manner.

Findings

01

Successfully learns disentangled skills in challenging environments.

02

Significantly outperforms previous skill discovery methods on downstream tasks.

03

Enables efficient skill chaining for hierarchical reinforcement learning.

Abstract

A hallmark of intelligent agents is the ability to learn reusable skills purely from unsupervised interaction with the environment. However, existing unsupervised skill discovery methods often learn entangled skills where one skill variable simultaneously influences many entities in the environment, making downstream skill chaining extremely challenging. We propose Disentangled Unsupervised Skill Discovery (DUSDi), a method for learning disentangled skills that can be efficiently reused to solve downstream tasks. DUSDi decomposes skills into disentangled components, where each skill component only affects one factor of the state space. Importantly, these skill components can be concurrently composed to generate low-level actions, and efficiently chained to tackle downstream tasks through hierarchical Reinforcement Learning. DUSDi defines a novel mutual-information-based objective to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Data Classification

MethodsSparse Evolutionary Training