Computational and Statistical Hardness of Calibration Distance
Mingda Qiao

TL;DR
This paper investigates the computational complexity of calculating and estimating the calibration distance, a key measure of miscalibration, providing efficient algorithms under certain conditions and proving NP-hardness when assumptions are relaxed.
Contribution
It introduces an efficient exact algorithm for uniform, noiseless cases, extends it to a polynomial-time approximation scheme, and establishes sample complexity bounds for estimation, along with new hardness proofs.
Findings
Efficient exact computation for uniform, noiseless distributions.
NP-hardness when assumptions are relaxed.
Sample complexity of Θ(1/ε^3) for estimation.
Abstract
The distance from calibration, introduced by B{\l}asiok, Gopalan, Hu, and Nakkiran (STOC 2023), has recently emerged as a central measure of miscalibration for probabilistic predictors. We study the fundamental problems of computing and estimating this quantity, given either an exact description of the data distribution or only sample access to it. We give an efficient algorithm that exactly computes the calibration distance when the distribution has a uniform marginal and noiseless labels, which improves the additive approximation of Qiao and Zheng (COLT 2024) for this special case. Perhaps surprisingly, the problem becomes -hard when either of the two assumptions is removed. We extend our algorithm to a polynomial-time approximation scheme for the general case. For the estimation problem, we show that samples are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs · Machine Learning and Algorithms
