Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement   Learning

Tianren Zhang; Shangqi Guo; Tian Tan; Xiaolin Hu; Feng Chen

arXiv:2006.11485·cs.LG·March 19, 2021·29 cites

Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning

Tianren Zhang, Shangqi Guo, Tian Tan, Xiaolin Hu, Feng Chen

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces an adjacency constraint for hierarchical reinforcement learning that limits high-level goal generation to nearby states, improving training efficiency and performance in various control tasks.

Contribution

It proposes a novel adjacency constraint mechanism, theoretically preserves optimal policies, and demonstrates improved empirical results over existing HRL methods.

Findings

01

Enhanced training efficiency in HRL models

02

Improved performance on control benchmarks

03

Effective adjacency network implementation

Abstract

Goal-conditioned hierarchical reinforcement learning (HRL) is a promising approach for scaling up reinforcement learning (RL) techniques. However, it often suffers from training inefficiency as the action space of the high-level, i.e., the goal space, is often large. Searching in a large goal space poses difficulties for both high-level subgoal generation and low-level policy learning. In this paper, we show that this problem can be effectively alleviated by restricting the high-level action space from the whole goal space to a $k$ -step adjacent region of the current state using an adjacency constraint. We theoretically prove that the proposed adjacency constraint preserves the optimal hierarchical policy in deterministic MDPs, and show that this constraint can be practically implemented by training an adjacency network that can discriminate between adjacent and non-adjacent subgoals.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

trzhang0116/HRAC
pytorchOfficial

Videos

Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Evolutionary Algorithms and Applications