The Calibration Generalization Gap
A. Michael Carrell, Neil Mallinar, James Lucas, Preetum Nakkiran

TL;DR
This paper introduces a systematic framework to analyze calibration in neural networks by decomposing calibration error into train set calibration and the generalization gap, revealing that reducing the generalization gap improves calibration.
Contribution
It proposes a decomposition of calibration error into train calibration and generalization gap, providing empirical evidence linking small generalization gaps to better calibration in neural networks.
Findings
Neural networks are typically well-calibrated on their training set.
The calibration generalization gap is upper-bounded by the standard generalization gap.
Reducing the generalization gap improves model calibration.
Abstract
Calibration is a fundamental property of a good predictive model: it requires that the model predicts correctly in proportion to its confidence. Modern neural networks, however, provide no strong guarantees on their calibration -- and can be either poorly calibrated or well-calibrated depending on the setting. It is currently unclear which factors contribute to good calibration (architecture, data augmentation, overparameterization, etc), though various claims exist in the literature. We propose a systematic way to study the calibration error: by decomposing it into (1) calibration error on the train set, and (2) the calibration generalization gap. This mirrors the fundamental decomposition of generalization. We then investigate each of these terms, and give empirical evidence that (1) DNNs are typically always calibrated on their train set, and (2) the calibration generalization gap…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Neural Networks and Applications · Advanced Neural Network Applications
