Computational and Statistical Hardness of Calibration Distance

Mingda Qiao

arXiv:2603.18391·cs.DS·March 20, 2026

Computational and Statistical Hardness of Calibration Distance

Mingda Qiao

PDF

Open Access

TL;DR

This paper investigates the computational complexity of calculating and estimating the calibration distance, a key measure of miscalibration, providing efficient algorithms under certain conditions and proving NP-hardness when assumptions are relaxed.

Contribution

It introduces an efficient exact algorithm for uniform, noiseless cases, extends it to a polynomial-time approximation scheme, and establishes sample complexity bounds for estimation, along with new hardness proofs.

Findings

01

Efficient exact computation for uniform, noiseless distributions.

02

NP-hardness when assumptions are relaxed.

03

Sample complexity of Θ(1/ε^3) for estimation.

Abstract

The distance from calibration, introduced by B{\l}asiok, Gopalan, Hu, and Nakkiran (STOC 2023), has recently emerged as a central measure of miscalibration for probabilistic predictors. We study the fundamental problems of computing and estimating this quantity, given either an exact description of the data distribution or only sample access to it. We give an efficient algorithm that exactly computes the calibration distance when the distribution has a uniform marginal and noiseless labels, which improves the $O (1/ ∣ X ∣)$ additive approximation of Qiao and Zheng (COLT 2024) for this special case. Perhaps surprisingly, the problem becomes $NP$ -hard when either of the two assumptions is removed. We extend our algorithm to a polynomial-time approximation scheme for the general case. For the estimation problem, we show that $Θ (1/ ϵ^{3})$ samples are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs · Machine Learning and Algorithms