Hierarchical Reinforcement Learning for Deep Goal Reasoning: An   Expressiveness Analysis

Weihang Yuan; H\'ector Mu\~noz-Avila

arXiv:2006.11704·cs.AI·June 23, 2020·1 cites

Hierarchical Reinforcement Learning for Deep Goal Reasoning: An Expressiveness Analysis

Weihang Yuan, H\'ector Mu\~noz-Avila

PDF

Open Access

TL;DR

This paper analyzes the expressiveness of hierarchical reinforcement learning architectures, demonstrating that recurrent hierarchical frameworks are more expressive than feedforward ones, supported by theoretical analysis and experimental validation.

Contribution

It introduces the recurrent hierarchical framework (RHF), generalizing existing architectures, and provides an expressiveness comparison with hierarchical DQN using formal grammar analysis.

Findings

01

RHF is more expressive than HF.

02

Experimental results support theoretical analysis.

03

Certain tasks cannot be solved by HF architectures.

Abstract

Hierarchical DQN (h-DQN) is a two-level architecture of feedforward neural networks where the meta level selects goals and the lower level takes actions to achieve the goals. We show tasks that cannot be solved by h-DQN, exemplifying the limitation of this type of hierarchical framework (HF). We describe the recurrent hierarchical framework (RHF), generalizing architectures that use a recurrent neural network at the meta level. We analyze the expressiveness of HF and RHF using context-sensitive grammars. We show that RHF is more expressive than HF. We perform experiments comparing an implementation of RHF with two HF baselines; the results corroborate our theoretical findings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Evolutionary Algorithms and Applications