Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned   Reinforcement Learning

Lisheng Wu; Ke Chen

arXiv:2404.12999·cs.LG·April 22, 2024

Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned Reinforcement Learning

Lisheng Wu, Ke Chen

PDF

Open Access

TL;DR

This paper introduces GEASD, a framework that improves exploration in goal-conditioned reinforcement learning by adaptively capturing environmental structural patterns, leading to more efficient and generalizable goal exploration.

Contribution

The paper presents a novel adaptive skill distribution method that enhances exploration efficiency and generalization in goal-conditioned reinforcement learning.

Findings

01

Significant improvement in exploration efficiency with GEASD.

02

Robust generalization to unseen tasks with similar structures.

03

Enhanced goal-spreading behavior through adaptive skill distribution.

Abstract

Exploration efficiency poses a significant challenge in goal-conditioned reinforcement learning (GCRL) tasks, particularly those with long horizons and sparse rewards. A primary limitation to exploration efficiency is the agent's inability to leverage environmental structural patterns. In this study, we introduce a novel framework, GEASD, designed to capture these patterns through an adaptive skill distribution during the learning process. This distribution optimizes the local entropy of achieved goals within a contextual horizon, enhancing goal-spreading behaviors and facilitating deep exploration in states containing familiar structural patterns. Our experiments reveal marked improvements in exploration efficiency using the adaptive skill distribution compared to a uniform skill distribution. Additionally, the learned skill distribution demonstrates robust generalization capabilities,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Transportation and Mobility Innovations