Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?

Ofir Nachum; Haoran Tang; Xingyu Lu; Shixiang Gu; Honglak Lee; Sergey; Levine

arXiv:1909.10618·cs.LG·January 1, 2020·51 cites

Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?

Ofir Nachum, Haoran Tang, Xingyu Lu, Shixiang Gu, Honglak Lee, Sergey, Levine

PDF

Open Access

TL;DR

This paper investigates why hierarchical reinforcement learning often outperforms standard RL, revealing that improved exploration is the main benefit, and introduces simpler exploration methods inspired by hierarchy.

Contribution

The study isolates exploration as the key advantage of hierarchical RL and proposes simpler exploration techniques that match hierarchical methods' performance.

Findings

01

Most benefits of hierarchical RL stem from enhanced exploration.

02

Hierarchical RL's advantages are not primarily due to easier policy learning.

03

Proposed exploration methods inspired by hierarchy perform competitively.

Abstract

Hierarchical reinforcement learning has demonstrated significant success at solving difficult reinforcement learning (RL) tasks. Previous works have motivated the use of hierarchy by appealing to a number of intuitive benefits, including learning over temporally extended transitions, exploring over temporally extended periods, and training and exploring in a more semantically meaningful action space, among others. However, in fully observed, Markovian settings, it is not immediately clear why hierarchical RL should provide benefits over standard "shallow" RL architectures. In this work, we isolate and evaluate the claimed benefits of hierarchical RL on a suite of tasks encompassing locomotion, navigation, and manipulation. Surprisingly, we find that most of the observed benefits of hierarchy can be attributed to improved exploration, as opposed to easier policy learning or imposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Supply Chain and Inventory Management