Learning and Exploiting Multiple Subgoals for Fast Exploration in   Hierarchical Reinforcement Learning

Libo Xing

arXiv:1905.05180·cs.LG·May 15, 2019·5 cites

Learning and Exploiting Multiple Subgoals for Fast Exploration in Hierarchical Reinforcement Learning

Libo Xing

PDF

Open Access

TL;DR

This paper introduces a multi-goal hierarchical reinforcement learning algorithm that uses a high-level manager to set multiple subgoals for the low-level worker, significantly improving exploration efficiency in sparse reward environments like Montezuma's Revenge.

Contribution

The paper proposes a novel multi-goal HRL framework with a high-level policy providing multiple subgoals, enabling faster exploration without manual environment-specific design.

Findings

01

Achieves state-of-the-art performance in Montezuma's Revenge

02

Reduces training time compared to existing HRL methods

03

Effectively learns to respond to multiple subgoals from high-level policies

Abstract

Hierarchical Reinforcement Learning (HRL) exploits temporally extended actions, or options, to make decisions from a higher-dimensional perspective to alleviate the sparse reward problem, one of the most challenging problems in reinforcement learning. The majority of existing HRL algorithms require either significant manual design with respect to the specific environment or enormous exploration to automatically learn options from data. To achieve fast exploration without using manual design, we devise a multi-goal HRL algorithm, consisting of a high-level policy Manager and a low-level policy Worker. The Manager provides the Worker multiple subgoals at each time step. Each subgoal corresponds to an option to control the environment. Although the agent may show some confusion at the beginning of training since it is guided by three diverse subgoals, the agent's behavior policy will…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Software Engineering Research