Improving model calibration with accuracy versus uncertainty optimization
Ranganath Krishnan, Omesh Tickoo

TL;DR
This paper introduces a novel optimization approach using an accuracy versus uncertainty loss to improve the calibration of uncertainty estimates in deep neural networks, especially under distributional shifts.
Contribution
It proposes the AvUC loss function for joint accuracy and uncertainty calibration, applicable during training and post-hoc, demonstrating superior calibration on large-scale image classification tasks.
Findings
Better calibration than existing methods under distributional shift
Effective both during training and post-hoc calibration
Improved uncertainty quantification in safety-critical applications
Abstract
Obtaining reliable and accurate quantification of uncertainty estimates from deep neural networks is important in safety-critical applications. A well-calibrated model should be accurate when it is certain about its prediction and indicate high uncertainty when it is likely to be inaccurate. Uncertainty calibration is a challenging problem as there is no ground truth available for uncertainty estimates. We propose an optimization method that leverages the relationship between accuracy and uncertainty as an anchor for uncertainty calibration. We introduce a differentiable accuracy versus uncertainty calibration (AvUC) loss function that allows a model to learn to provide well-calibrated uncertainties, in addition to improved accuracy. We also demonstrate the same methodology can be extended to post-hoc uncertainty calibration on pretrained models. We illustrate our approach with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification
MethodsVariational Inference
