Biased Dreams: Limitations to Epistemic Uncertainty Quantification in Latent Space Models

Julia Berger; Bernd Frauenknecht; Sebastian Trimpe; Bastian Leibe

arXiv:2604.25416·cs.LG·April 29, 2026

Biased Dreams: Limitations to Epistemic Uncertainty Quantification in Latent Space Models

Julia Berger, Bernd Frauenknecht, Sebastian Trimpe, Bastian Leibe

PDF

TL;DR

This paper reveals that latent space models in reinforcement learning exhibit attractor biases that undermine accurate uncertainty estimation and reward prediction, challenging their reliability.

Contribution

It empirically demonstrates the bias in latent transitions and its impact on uncertainty quantification in latent dynamics models used in reinforcement learning.

Findings

01

Latent transitions are biased toward well-represented regions.

02

Biases can cause latent models to deviate from true environment dynamics.

03

Overestimation of rewards occurs in high-reward attractor regions.

Abstract

Model-Based Reinforcement Learning distinguishes between physical dynamics models operating on proprioceptive inputs and latent dynamics models operating on high-dimensional image observations. A prominent latent approach is the Recurrent State Space Model used in the Dreamer family. While epistemic uncertainty quantification to inform exploration and mitigate model exploitation is well established for physical dynamics models, its transfer to latent dynamics models has received limited scrutiny. We empirically demonstrate that latent transitions are biased toward well-represented regions of latent space, exhibiting an attractor behavior that can deviate from true environment dynamics. As a result, discrepancies in environment dynamics may not manifest in latent space, undermining the reliability of epistemic uncertainty estimates. Because these attractors often lie in high-reward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.