Exploring the limits of Hierarchical World Models in Reinforcement Learning
Robin Schiewer, Anand Subramoney, Laurenz Wiskott

TL;DR
This paper introduces a novel hierarchical world model framework for reinforcement learning that enables multi-level decision making with low-dimensional abstract actions, but faces challenges with model exploitation affecting performance.
Contribution
The work presents a new HMBRL framework with environment-agnostic temporal abstraction and a hierarchical world model, advancing the integration of hierarchical and model-based RL.
Findings
Facilitates decision making across two abstraction levels
Uses low-dimensional abstract actions for efficiency
Identifies model exploitation as a key challenge
Abstract
Hierarchical model-based reinforcement learning (HMBRL) aims to combine the benefits of better sample efficiency of model based reinforcement learning (MBRL) with the abstraction capability of hierarchical reinforcement learning (HRL) to solve complex tasks efficiently. While HMBRL has great potential, it still lacks wide adoption. In this work we describe a novel HMBRL framework and evaluate it thoroughly. To complement the multi-layered decision making idiom characteristic for HRL, we construct hierarchical world models that simulate environment dynamics at various levels of temporal abstraction. These models are used to train a stack of agents that communicate in a top-down manner by proposing goals to their subordinate agents. A significant focus of this study is the exploration of a static and environment agnostic temporal abstraction, which allows concurrent training of models and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications
