Consistent and Asymptotically Unbiased Estimation of Proper Calibration Errors
Teodora Popordanoska, Sebastian G. Gruber, Aleksei Tiulpin, Florian, Buettner, Matthew B. Blaschko

TL;DR
This paper introduces a method for consistent and asymptotically unbiased estimation of proper calibration errors and refinement in probabilistic models, addressing a key gap in the evaluation of model calibration.
Contribution
It proposes a novel estimator for calibration errors and refinement that has proven statistical properties, including consistency and asymptotic unbiasedness, and introduces the Kullback--Leibler calibration error.
Findings
The estimator is consistent and asymptotically unbiased.
The relation between refinement and f-divergences is established.
Empirical validation confirms the estimator's properties and guides calibration method choice.
Abstract
Proper scoring rules evaluate the quality of probabilistic predictions, playing an essential role in the pursuit of accurate and well-calibrated models. Every proper score decomposes into two fundamental components -- proper calibration error and refinement -- utilizing a Bregman divergence. While uncertainty calibration has gained significant attention, current literature lacks a general estimator for these quantities with known statistical properties. To address this gap, we propose a method that allows consistent, and asymptotically unbiased estimation of all proper calibration errors and refinement terms. In particular, we introduce Kullback--Leibler calibration error, induced by the commonly used cross-entropy loss. As part of our results, we prove the relation between refinement and f-divergences, which implies information monotonicity in neural networks, regardless of which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Statistical Methods and Models
