Consistent and Asymptotically Unbiased Estimation of Proper Calibration   Errors

Teodora Popordanoska; Sebastian G. Gruber; Aleksei Tiulpin; Florian; Buettner; Matthew B. Blaschko

arXiv:2312.08589·cs.LG·December 15, 2023·1 cites

Consistent and Asymptotically Unbiased Estimation of Proper Calibration Errors

Teodora Popordanoska, Sebastian G. Gruber, Aleksei Tiulpin, Florian, Buettner, Matthew B. Blaschko

PDF

Open Access

TL;DR

This paper introduces a method for consistent and asymptotically unbiased estimation of proper calibration errors and refinement in probabilistic models, addressing a key gap in the evaluation of model calibration.

Contribution

It proposes a novel estimator for calibration errors and refinement that has proven statistical properties, including consistency and asymptotic unbiasedness, and introduces the Kullback--Leibler calibration error.

Findings

01

The estimator is consistent and asymptotically unbiased.

02

The relation between refinement and f-divergences is established.

03

Empirical validation confirms the estimator's properties and guides calibration method choice.

Abstract

Proper scoring rules evaluate the quality of probabilistic predictions, playing an essential role in the pursuit of accurate and well-calibrated models. Every proper score decomposes into two fundamental components -- proper calibration error and refinement -- utilizing a Bregman divergence. While uncertainty calibration has gained significant attention, current literature lacks a general estimator for these quantities with known statistical properties. To address this gap, we propose a method that allows consistent, and asymptotically unbiased estimation of all proper calibration errors and refinement terms. In particular, we introduce Kullback--Leibler calibration error, induced by the commonly used cross-entropy loss. As part of our results, we prove the relation between refinement and f-divergences, which implies information monotonicity in neural networks, regardless of which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Statistical Methods and Models