Learning Latent Dynamic Robust Representations for World Models
Ruixiang Sun, Hongyu Zang, Xin Li, Riashat Islam

TL;DR
This paper introduces a novel approach combining spatio-temporal masking, bisimulation, and latent reconstruction within a hybrid recurrent state-space model to improve visual model-based reinforcement learning in noisy environments.
Contribution
It proposes a new latent dynamic robust representation method and a hybrid recurrent state-space model to enhance world model robustness and performance in complex visual tasks.
Findings
Significant performance improvements in visually complex control tasks.
Effective filtering of irrelevant noise in environment observations.
Enhanced stability in joint training of representations, dynamics, and policy.
Abstract
Visual Model-Based Reinforcement Learning (MBRL) promises to encapsulate agent's knowledge about the underlying dynamics of the environment, enabling learning a world model as a useful planner. However, top MBRL agents such as Dreamer often struggle with visual pixel-based inputs in the presence of exogenous or irrelevant noise in the observation space, due to failure to capture task-specific features while filtering out irrelevant spatio-temporal details. To tackle this problem, we apply a spatio-temporal masking strategy, a bisimulation principle, combined with latent reconstruction, to capture endogenous task-specific aspects of the environment for world models, effectively eliminating non-essential information. Joint training of representations, dynamics, and policy often leads to instabilities. To further address this issue, we develop a Hybrid Recurrent State-Space Model (HRSSM)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Time Series Analysis and Forecasting · Machine Learning and Data Classification
