Unreliable Uncertainty Estimates with Monte Carlo Dropout
Aslak Djupsk{\aa}s, Alexander Johannes Stasik, Signe Riemer-S{\o}rensen

TL;DR
This paper critically evaluates Monte Carlo dropout for uncertainty estimation in deep learning, revealing its limitations in accurately capturing true uncertainty compared to Bayesian methods, especially in extrapolation regions.
Contribution
It provides an empirical comparison showing that Monte Carlo dropout often fails to reliably estimate uncertainty, highlighting its limitations relative to Bayesian neural networks and Gaussian processes.
Findings
MCD struggles to reflect true uncertainty accurately.
MCD fails to capture increased uncertainty in extrapolation regions.
Uncertainty estimates from MCD are less reliable than Bayesian approaches.
Abstract
Reliable uncertainty estimation is crucial for machine learning models, especially in safety-critical domains. While exact Bayesian inference offers a principled approach, it is often computationally infeasible for deep neural networks. Monte Carlo dropout (MCD) was proposed as an efficient approximation to Bayesian inference in deep learning by applying neuron dropout at inference time \citep{gal2016dropout}. Hence, the method generates multiple sub-models yielding a distribution of predictions to estimate uncertainty. We empirically investigate its ability to capture true uncertainty and compare to Gaussian Processes (GP) and Bayesian Neural Networks (BNN). We find that MCD struggles to accurately reflect the underlying true uncertainty, particularly failing to capture increased uncertainty in extrapolation and interpolation regions as observed in Bayesian models. The findings suggest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Adversarial Robustness in Machine Learning · Bayesian Modeling and Causal Inference
