On Credit Assignment in Hierarchical Reinforcement Learning
Joery A. de Vries, Thomas M. Moerland, Aske Plaat

TL;DR
This paper investigates hierarchical credit assignment in reinforcement learning, revealing how multistep backups can be adapted for hierarchy, and introduces a new algorithm HierQ_k(λ) that improves agent performance.
Contribution
It provides a fundamental understanding of hierarchical backups and proposes HierQ_k(λ), a novel algorithm that enhances reinforcement learning through hierarchical credit assignment.
Findings
Hierarchical backups can be viewed as multistep backups with skip connections.
Generalizing to multistep return estimation requires environment trace partitioning.
HierQ_k(λ) demonstrates performance improvements due to hierarchical credit assignment.
Abstract
Hierarchical Reinforcement Learning (HRL) has held longstanding promise to advance reinforcement learning. Yet, it has remained a considerable challenge to develop practical algorithms that exhibit some of these promises. To improve our fundamental understanding of HRL, we investigate hierarchical credit assignment from the perspective of conventional multistep reinforcement learning. We show how e.g., a 1-step `hierarchical backup' can be seen as a conventional multistep backup with skip connections over time connecting each subsequent state to the first independent of actions inbetween. Furthermore, we find that generalizing hierarchy to multistep return estimation methods requires us to consider how to partition the environment trace, in order to construct backup paths. We leverage these insight to develop a new hierarchical algorithm Hier, for which we demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications
