Exploring the limits of Hierarchical World Models in Reinforcement   Learning

Robin Schiewer; Anand Subramoney; Laurenz Wiskott

arXiv:2406.00483·cs.LG·June 4, 2024

Exploring the limits of Hierarchical World Models in Reinforcement Learning

Robin Schiewer, Anand Subramoney, Laurenz Wiskott

PDF

Open Access

TL;DR

This paper introduces a novel hierarchical world model framework for reinforcement learning that enables multi-level decision making with low-dimensional abstract actions, but faces challenges with model exploitation affecting performance.

Contribution

The work presents a new HMBRL framework with environment-agnostic temporal abstraction and a hierarchical world model, advancing the integration of hierarchical and model-based RL.

Findings

01

Facilitates decision making across two abstraction levels

02

Uses low-dimensional abstract actions for efficiency

03

Identifies model exploitation as a key challenge

Abstract

Hierarchical model-based reinforcement learning (HMBRL) aims to combine the benefits of better sample efficiency of model based reinforcement learning (MBRL) with the abstraction capability of hierarchical reinforcement learning (HRL) to solve complex tasks efficiently. While HMBRL has great potential, it still lacks wide adoption. In this work we describe a novel HMBRL framework and evaluate it thoroughly. To complement the multi-layered decision making idiom characteristic for HRL, we construct hierarchical world models that simulate environment dynamics at various levels of temporal abstraction. These models are used to train a stack of agents that communicate in a top-down manner by proposing goals to their subordinate agents. A significant focus of this study is the exploration of a static and environment agnostic temporal abstraction, which allows concurrent training of models and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications