Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward   Long-Horizon Goal-Conditioned Reinforcement Learning

Lisheng Wu; Ke Chen

arXiv:2210.16058·cs.LG·December 20, 2023·1 cites

Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning

Lisheng Wu, Ke Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a goal exploration method using pre-trained skills and entropy optimization to improve exploration efficiency in sparse-reward, long-horizon goal-conditioned reinforcement learning tasks.

Contribution

It proposes a novel entropy-based goal exploration objective combined with skill learning from environment patterns to enhance GCRL performance.

Findings

01

Significantly improves exploration efficiency in benchmark tasks.

02

Maintains or enhances baseline GCRL performance.

03

Demonstrates effectiveness across various sparse-reward, long-horizon environments.

Abstract

Reinforcement learning (RL) often struggles to accomplish a sparse-reward long-horizon task in a complex environment. Goal-conditioned reinforcement learning (GCRL) has been employed to tackle this difficult problem via a curriculum of easy-to-reach sub-goals. In GCRL, exploring novel sub-goals is essential for the agent to ultimately find the pathway to the desired goal. How to explore novel sub-goals efficiently is one of the most challenging issues in GCRL. Several goal exploration methods have been proposed to address this issue but still struggle to find the desired goals efficiently. In this paper, we propose a novel learning objective by optimizing the entropy of both achieved and new goals to be explored for more efficient goal exploration in sub-goal selection based GCRL. To optimize this objective, we first explore and exploit the frequently occurring goal-transition patterns…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

geaps/geaps
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics