On Credit Assignment in Hierarchical Reinforcement Learning

Joery A. de Vries; Thomas M. Moerland; Aske Plaat

arXiv:2203.03292·cs.LG·March 8, 2022

On Credit Assignment in Hierarchical Reinforcement Learning

Joery A. de Vries, Thomas M. Moerland, Aske Plaat

PDF

Open Access 1 Repo

TL;DR

This paper investigates hierarchical credit assignment in reinforcement learning, revealing how multistep backups can be adapted for hierarchy, and introduces a new algorithm HierQ_k(λ) that improves agent performance.

Contribution

It provides a fundamental understanding of hierarchical backups and proposes HierQ_k(λ), a novel algorithm that enhances reinforcement learning through hierarchical credit assignment.

Findings

01

Hierarchical backups can be viewed as multistep backups with skip connections.

02

Generalizing to multistep return estimation requires environment trace partitioning.

03

HierQ_k(λ) demonstrates performance improvements due to hierarchical credit assignment.

Abstract

Hierarchical Reinforcement Learning (HRL) has held longstanding promise to advance reinforcement learning. Yet, it has remained a considerable challenge to develop practical algorithms that exhibit some of these promises. To improve our fundamental understanding of HRL, we investigate hierarchical credit assignment from the perspective of conventional multistep reinforcement learning. We show how e.g., a 1-step `hierarchical backup' can be seen as a conventional multistep backup with $n$ skip connections over time connecting each subsequent state to the first independent of actions inbetween. Furthermore, we find that generalizing hierarchy to multistep return estimation methods requires us to consider how to partition the environment trace, in order to construct backup paths. We leverage these insight to develop a new hierarchical algorithm Hier $Q_{k} (λ)$ , for which we demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

joeryjoery/hierq
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications