Not all uncertainty is alike: volatility, stochasticity, and exploration
Payam Piray

TL;DR
This paper distinguishes different sources of environmental uncertainty, showing that volatility promotes exploration while stochasticity suppresses it, and introduces a new exploration method called CAUSE that leverages this insight.
Contribution
It formally characterizes the opposite effects of volatility and stochasticity on exploration and develops CAUSE, a novel exploration bonus based on control-as-inference.
Findings
Volatility enhances exploration, stochasticity suppresses it.
CAUSE outperforms standard strategies in noisy environments.
Reversed exploration occurs with pathological noise inference.
Abstract
Adaptive decision-making in biological and artificial intelligence requires balancing the exploitation of known outcomes with the exploration of uncertain alternatives. Although prior work suggests that uncertainty generally promotes exploration, it has typically treated distinct sources of environmental uncertainty as equivalent. We consider environments with latent reward states that drift over time (volatility) and are observed through noisy outcomes (stochasticity). Both increase posterior uncertainty, yet we show they drive optimal exploration in opposite directions: volatility enhances it, stochasticity suppresses it. We establish this asymmetry formally by extending the Gittins index framework to Gaussian state-space bandits with latent dynamics. We further derive Cause-Aware Uncertainty-Sensitive Exploration (CAUSE), a closed-form exploration bonus obtained via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
