Multimodal Dreaming: A Global Workspace Approach to World Model-Based Reinforcement Learning
L\'eopold Mayti\'e, Roland Bertin Johannet, Rufin VanRullen

TL;DR
This paper explores integrating Global Workspace Theory with world models in reinforcement learning, demonstrating improved training efficiency and robustness through multimodal latent space dreaming.
Contribution
It introduces a novel RL system combining GW with world models, showing benefits in training efficiency and modality robustness compared to existing methods.
Findings
Fewer environment steps needed for training.
Enhanced robustness to missing observation modalities.
Emergent multimodal integration capabilities.
Abstract
Humans leverage rich internal models of the world to reason about the future, imagine counterfactuals, and adapt flexibly to new situations. In Reinforcement Learning (RL), world models aim to capture how the environment evolves in response to the agent's actions, facilitating planning and generalization. However, typical world models directly operate on the environment variables (e.g. pixels, physical attributes), which can make their training slow and cumbersome; instead, it may be advantageous to rely on high-level latent dimensions that capture relevant multimodal variables. Global Workspace (GW) Theory offers a cognitive framework for multimodal integration and information broadcasting in the brain, and recent studies have begun to introduce efficient deep learning implementations of GW. Here, we evaluate the capabilities of an RL system combining GW with a world model. We compare…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmbodied and Extended Cognition · Reinforcement Learning in Robotics · Action Observation and Synchronization
MethodsEntropy Regularization · Proximal Policy Optimization
