Extending confidence calibration to generalised measures of variation

Andrew Thompson; Vivek Desai

arXiv:2602.12975·cs.LG·February 16, 2026

Extending confidence calibration to generalised measures of variation

Andrew Thompson, Vivek Desai

PDF

Open Access

TL;DR

This paper introduces the Variation Calibration Error (VCE), a new metric for assessing the calibration of machine learning classifiers across various measures of probability variation, extending beyond confidence calibration.

Contribution

The paper proposes the VCE metric, extending calibration assessment to any variation measure, and demonstrates its desirable properties through synthetic examples.

Findings

01

VCE approaches zero for perfectly calibrated predictions as data increases

02

VCE outperforms existing entropy-based metrics like UCE in calibration assessment

03

Numerical examples validate the effectiveness of VCE in different scenarios

Abstract

We propose the Variation Calibration Error (VCE) metric for assessing the calibration of machine learning classifiers. The metric can be viewed as an extension of the well-known Expected Calibration Error (ECE) which assesses the calibration of the maximum probability or confidence. Other ways of measuring the variation of a probability distribution exist which have the advantage of taking into account the full probability distribution, for example the Shannon entropy. We show how the ECE approach can be extended from assessing confidence calibration to assessing the calibration of any metric of variation. We present numerical examples upon synthetic predictions which are perfectly calibrated by design, demonstrating that, in this scenario, the VCE has the desired property of approaching zero as the number of data samples increases, in contrast to another entropy-based calibration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Imbalanced Data Classification Techniques · Machine Learning and Data Classification