Too Sharp, Too Sure: When Calibration Follows Curvature

Alessandro Morosini; Matea Gjika; Tomaso Poggio; Pierfrancesco Beneventano

arXiv:2604.20614·cs.LG·April 23, 2026

Too Sharp, Too Sure: When Calibration Follows Curvature

Alessandro Morosini, Matea Gjika, Tomaso Poggio, Pierfrancesco Beneventano

PDF

TL;DR

This paper investigates the relationship between calibration, curvature, and margins in neural network training, proposing a margin-aware objective to improve calibration without losing accuracy.

Contribution

It reveals the coupling between calibration and curvature during training and introduces a new margin-aware training method to enhance calibration.

Findings

01

ECE closely tracks curvature-based sharpness during training

02

Both ECE and Gauss--Newton curvature are controlled by margin-dependent exponential tails

03

Margin-aware training improves out-of-sample calibration without sacrificing accuracy

Abstract

Modern neural networks can achieve high accuracy while remaining poorly calibrated, producing confidence estimates that do not match empirical correctness. Yet calibration is often treated as a post-hoc attribute. We take a different perspective: we study calibration as a training-time phenomenon on small vision tasks, and ask whether calibrated solutions can be obtained reliably by intervening on the training procedure. We identify a tight coupling between calibration, curvature, and margins during training of deep networks under multiple gradient-based methods. Empirically, Expected Calibration Error (ECE) closely tracks curvature-based sharpness throughout optimization. Mathematically, we show that both ECE and Gauss--Newton curvature are controlled, up to problem-specific constants, by the same margin-dependent exponential tail functional along the trajectory. Guided by this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.