Adaptive World Models: Learning Behaviors by Latent Imagination Under   Non-Stationarity

Emiliyan Gospodinov; Vaisakh Shaj; Philipp Becker; Stefan Geyer,; Gerhard Neumann

arXiv:2411.01342·cs.LG·November 5, 2024

Adaptive World Models: Learning Behaviors by Latent Imagination Under Non-Stationarity

Emiliyan Gospodinov, Vaisakh Shaj, Philipp Becker, Stefan Geyer,, Gerhard Neumann

PDF

Open Access

TL;DR

This paper introduces Hidden Parameter-POMDPs, a formalism for adaptive world models that learn robust, task-aware behaviors in non-stationary environments, advancing embodied intelligence.

Contribution

It presents a new formalism for control in non-stationary settings, enabling unsupervised learning of task abstractions and structured latent spaces.

Findings

01

Successfully learns robust behaviors in non-stationary RL benchmarks

02

Effectively learns task abstractions in an unsupervised manner

03

Creates structured, task-aware latent spaces

Abstract

Developing foundational world models is a key research direction for embodied intelligence, with the ability to adapt to non-stationary environments being a crucial criterion. In this work, we introduce a new formalism, Hidden Parameter-POMDP, designed for control with adaptive world models. We demonstrate that this approach enables learning robust behaviors across a variety of non-stationary RL benchmarks. Additionally, this formalism effectively learns task abstractions in an unsupervised manner, resulting in structured, task-aware latent spaces.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications · Neural Networks and Applications