On Deep Neural Network Calibration by Regularization and its Impact on Refinement
Aditya Singh, Alessandro Bay, Biswa Sengupta, Andrea Mirabile

TL;DR
This paper investigates how regularization-based calibration methods affect the refinement of deep neural networks, revealing that many such methods degrade the network's ability to distinguish correct from incorrect predictions, especially under data shift.
Contribution
It provides a theoretical and empirical analysis linking calibration and refinement, highlighting the trade-offs of current calibration techniques on model separability.
Findings
Calibration methods like label smoothing and mixup reduce refinement.
Refinement degradation persists under natural data shift.
Calibration-refinement trade-off is a common phenomenon across methods.
Abstract
Deep neural networks have been shown to be highly miscalibrated. often they tend to be overconfident in their predictions. It poses a significant challenge for safety-critical systems to utilise deep neural networks (DNNs), reliably. Many recently proposed approaches to mitigate this have demonstrated substantial progress in improving DNN calibration. However, they hardly touch upon refinement, which historically has been an essential aspect of calibration. Refinement indicates separability of a network's correct and incorrect predictions. This paper presents a theoretically and empirically supported exposition reviewing refinement of a calibrated model. Firstly, we show the breakdown of expected calibration error (ECE), into predicted confidence and refinement under the assumption of over-confident predictions. Secondly, linking with this result, we highlight that regularization based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification
MethodsMixup
