Goal-Conditioned Reinforcement Learning with Disentanglement-based   Reachability Planning

Zhifeng Qian; Mingyu You; Hongjun Zhou; Xuanhui Xu; Bin He

arXiv:2307.10846·cs.RO·July 21, 2023

Goal-Conditioned Reinforcement Learning with Disentanglement-based Reachability Planning

Zhifeng Qian, Mingyu You, Hongjun Zhou, Xuanhui Xu, Bin He

PDF

Open Access

TL;DR

This paper introduces REPlan, a goal-conditioned reinforcement learning method with disentangled representations and reachability discrimination, significantly improving performance in temporally extended tasks with high-dimensional observations.

Contribution

The paper proposes a novel REPlan algorithm combining a disentangled representation module and a reachability discrimination module for efficient goal-reaching in high-dimensional spaces.

Findings

01

REPlan outperforms state-of-the-art methods in vision-based simulation tasks.

02

REPlan effectively learns compact representations for high-dimensional observations.

03

REPlan demonstrates successful real-world task performance.

Abstract

Goal-Conditioned Reinforcement Learning (GCRL) can enable agents to spontaneously set diverse goals to learn a set of skills. Despite the excellent works proposed in various fields, reaching distant goals in temporally extended tasks remains a challenge for GCRL. Current works tackled this problem by leveraging planning algorithms to plan intermediate subgoals to augment GCRL. Their methods need two crucial requirements: (i) a state representation space to search valid subgoals, and (ii) a distance function to measure the reachability of subgoals. However, they struggle to scale to high-dimensional state space due to their non-compact representations. Moreover, they cannot collect high-quality training data through standard GC policies, which results in an inaccurate distance function. Both affect the efficiency and performance of planning and policy learning. In the paper, we propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning

MethodsDense Connections · Q-Learning · Convolution · Deep Q-Network · Random Ensemble Mixture