TL;DR
This paper compares two deep learning methods, Deep Ensembles and Deep Evidential Regression, for estimating aleatoric uncertainty across different data types and noise levels, highlighting calibration issues and the need for further refinement.
Contribution
It systematically evaluates and compares the performance and calibration of DE and DER in estimating aleatoric uncertainty for 0D and 2D data with varying noise levels.
Findings
Uncertainty scales with injected noise level.
Predicted uncertainty is often miscalibrated compared to true uncertainty.
Calibration issues are more pronounced in high-noise and high-dimensional scenarios.
Abstract
Assessing the quality of aleatoric uncertainty estimates from uncertainty quantification (UQ) deep learning methods is important in scientific contexts, where uncertainty is physically meaningful and important to characterize and interpret exactly. We systematically compare aleatoric uncertainty measured by two UQ techniques, Deep Ensembles (DE) and Deep Evidential Regression (DER). Our method focuses on both zero-dimensional (0D) and two-dimensional (2D) data, to explore how the UQ methods function for different data dimensionalities. We investigate uncertainty injected on the input and output variables and include a method to propagate uncertainty in the case of input uncertainty so that we can compare the predicted aleatoric uncertainty to the known values. We experiment with three levels of noise. The aleatoric uncertainty predicted across all models and experiments scales with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDeep Ensembles
