On Calibrated Model Uncertainty in Deep Learning
Biraja Ghoshal, Allan Tucker

TL;DR
This paper enhances Bayesian neural networks by calibrating uncertainty estimates to improve decision-making in high-stakes tasks like Covid-19 diagnosis, without sacrificing accuracy.
Contribution
It introduces a loss-calibrated Bayesian inference method for dropweights neural networks and proposes MUCE for measuring calibration quality.
Findings
Reduces miscalibration significantly in models
Improves diagnostic reliability without accuracy loss
Enhances detection of Covid-19 from X-Ray images
Abstract
Estimated uncertainty by approximate posteriors in Bayesian neural networks are prone to miscalibration, which leads to overconfident predictions in critical tasks that have a clear asymmetric cost or significant losses. Here, we extend the approximate inference for the loss-calibrated Bayesian framework to dropweights based Bayesian neural networks by maximising expected utility over a model posterior to calibrate uncertainty in deep learning. Furthermore, we show that decisions informed by loss-calibrated uncertainty can improve diagnostic performance to a greater extent than straightforward alternatives. We propose Maximum Uncertainty Calibration Error (MUCE) as a metric to measure calibrated confidence, in addition to its prediction especially for high-risk applications, where the goal is to minimise the worst-case deviation between error and estimated uncertainty. In experiments,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
