On Deep Neural Network Calibration by Regularization and its Impact on   Refinement

Aditya Singh; Alessandro Bay; Biswa Sengupta; Andrea Mirabile

arXiv:2106.09385·cs.LG·May 5, 2022

On Deep Neural Network Calibration by Regularization and its Impact on Refinement

Aditya Singh, Alessandro Bay, Biswa Sengupta, Andrea Mirabile

PDF

Open Access

TL;DR

This paper investigates how regularization-based calibration methods affect the refinement of deep neural networks, revealing that many such methods degrade the network's ability to distinguish correct from incorrect predictions, especially under data shift.

Contribution

It provides a theoretical and empirical analysis linking calibration and refinement, highlighting the trade-offs of current calibration techniques on model separability.

Findings

01

Calibration methods like label smoothing and mixup reduce refinement.

02

Refinement degradation persists under natural data shift.

03

Calibration-refinement trade-off is a common phenomenon across methods.

Abstract

Deep neural networks have been shown to be highly miscalibrated. often they tend to be overconfident in their predictions. It poses a significant challenge for safety-critical systems to utilise deep neural networks (DNNs), reliably. Many recently proposed approaches to mitigate this have demonstrated substantial progress in improving DNN calibration. However, they hardly touch upon refinement, which historically has been an essential aspect of calibration. Refinement indicates separability of a network's correct and incorrect predictions. This paper presents a theoretically and empirically supported exposition reviewing refinement of a calibrated model. Firstly, we show the breakdown of expected calibration error (ECE), into predicted confidence and refinement under the assumption of over-confident predictions. Secondly, linking with this result, we highlight that regularization based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification

MethodsMixup