Multimodal Dreaming: A Global Workspace Approach to World Model-Based Reinforcement Learning

L\'eopold Mayti\'e; Roland Bertin Johannet; Rufin VanRullen

arXiv:2502.21142·cs.AI·October 29, 2025

Multimodal Dreaming: A Global Workspace Approach to World Model-Based Reinforcement Learning

L\'eopold Mayti\'e, Roland Bertin Johannet, Rufin VanRullen

PDF

Open Access

TL;DR

This paper explores integrating Global Workspace Theory with world models in reinforcement learning, demonstrating improved training efficiency and robustness through multimodal latent space dreaming.

Contribution

It introduces a novel RL system combining GW with world models, showing benefits in training efficiency and modality robustness compared to existing methods.

Findings

01

Fewer environment steps needed for training.

02

Enhanced robustness to missing observation modalities.

03

Emergent multimodal integration capabilities.

Abstract

Humans leverage rich internal models of the world to reason about the future, imagine counterfactuals, and adapt flexibly to new situations. In Reinforcement Learning (RL), world models aim to capture how the environment evolves in response to the agent's actions, facilitating planning and generalization. However, typical world models directly operate on the environment variables (e.g. pixels, physical attributes), which can make their training slow and cumbersome; instead, it may be advantageous to rely on high-level latent dimensions that capture relevant multimodal variables. Global Workspace (GW) Theory offers a cognitive framework for multimodal integration and information broadcasting in the brain, and recent studies have begun to introduce efficient deep learning implementations of GW. Here, we evaluate the capabilities of an RL system combining GW with a world model. We compare…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmbodied and Extended Cognition · Reinforcement Learning in Robotics · Action Observation and Synchronization

MethodsEntropy Regularization · Proximal Policy Optimization