Too Sharp, Too Sure: When Calibration Follows Curvature
Alessandro Morosini, Matea Gjika, Tomaso Poggio, Pierfrancesco Beneventano

TL;DR
This paper investigates the relationship between calibration, curvature, and margins in neural network training, proposing a margin-aware objective to improve calibration without losing accuracy.
Contribution
It reveals the coupling between calibration and curvature during training and introduces a new margin-aware training method to enhance calibration.
Findings
ECE closely tracks curvature-based sharpness during training
Both ECE and Gauss--Newton curvature are controlled by margin-dependent exponential tails
Margin-aware training improves out-of-sample calibration without sacrificing accuracy
Abstract
Modern neural networks can achieve high accuracy while remaining poorly calibrated, producing confidence estimates that do not match empirical correctness. Yet calibration is often treated as a post-hoc attribute. We take a different perspective: we study calibration as a training-time phenomenon on small vision tasks, and ask whether calibrated solutions can be obtained reliably by intervening on the training procedure. We identify a tight coupling between calibration, curvature, and margins during training of deep networks under multiple gradient-based methods. Empirically, Expected Calibration Error (ECE) closely tracks curvature-based sharpness throughout optimization. Mathematically, we show that both ECE and Gauss--Newton curvature are controlled, up to problem-specific constants, by the same margin-dependent exponential tail functional along the trajectory. Guided by this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
