Unsupervised Skill Discovery through Skill Regions Differentiation

Ting Xiao; Jiakun Zheng; Rushuai Yang; Kang Xu; Qiaosheng Zhang; Peng Liu; Chenjia Bai

arXiv:2506.14420·cs.LG·June 18, 2025

Unsupervised Skill Discovery through Skill Regions Differentiation

Ting Xiao, Jiakun Zheng, Rushuai Yang, Kang Xu, Qiaosheng Zhang, Peng Liu, Chenjia Bai

PDF

Open Access

TL;DR

This paper introduces a novel unsupervised skill discovery method that maximizes inter-skill state diversity using a conditional autoencoder, enabling effective exploration in high-dimensional spaces like images.

Contribution

It proposes a new skill discovery objective based on state density deviation and a conditional autoencoder for high-dimensional state exploration.

Findings

01

Learns meaningful skills in complex environments

02

Achieves superior downstream task performance

03

Effective in high-dimensional state spaces

Abstract

Unsupervised Reinforcement Learning (RL) aims to discover diverse behaviors that can accelerate the learning of downstream tasks. Previous methods typically focus on entropy-based exploration or empowerment-driven skill learning. However, entropy-based exploration struggles in large-scale state spaces (e.g., images), and empowerment-based methods with Mutual Information (MI) estimations have limitations in state exploration. To address these challenges, we propose a novel skill discovery objective that maximizes the deviation of the state density of one skill from the explored regions of other skills, encouraging inter-skill state diversity similar to the initial MI objective. For state-density estimation, we construct a novel conditional autoencoder with soft modularization for different skill policies in high-dimensional space. Meanwhile, to incentivize intra-skill exploration, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHigher Education Learning Practices · Educational Technology and Assessment

MethodsFocus