World Model Robustness via Surprise Recognition

Geigh Zollicoffer; Tanush Chopra; Mingkuan Yan; Xiaoxu Ma; Kenneth Eaton; Mark Riedl

arXiv:2512.01119·cs.LG·December 2, 2025

World Model Robustness via Surprise Recognition

Geigh Zollicoffer, Tanush Chopra, Mingkuan Yan, Xiaoxu Ma, Kenneth Eaton, Mark Riedl

PDF

Open Access

TL;DR

This paper presents a novel approach that uses surprise recognition in world models to improve the robustness of reinforcement learning agents against out-of-distribution noise and sensor faults, maintaining performance across diverse environments.

Contribution

The authors introduce surprise-based rejection sampling techniques for world models, enhancing robustness to sensor faults and noise in reinforcement learning agents.

Findings

01

Techniques preserve performance under various noise conditions.

02

Methods improve stability of different world model architectures.

03

Approach tested successfully in self-driving simulation environments.

Abstract

AI systems deployed in the real world must contend with distractions and out-of-distribution (OOD) noise that can destabilize their policies and lead to unsafe behavior. While robust training can reduce sensitivity to some forms of noise, it is infeasible to anticipate all possible OOD conditions. To mitigate this issue, we develop an algorithm that leverages a world model's inherent measure of surprise to reduce the impact of noise in world model--based reinforcement learning agents. We introduce both multi-representation and single-representation rejection sampling, enabling robustness to settings with multiple faulty sensors or a single faulty sensor. While the introduction of noise typically degrades agent performance, we show that our techniques preserve performance relative to baselines under varying types and levels of noise across multiple environments within self-driving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Autonomous Vehicle Technology and Safety