Truthfulness of Calibration Measures
Nika Haghtalab, Mingda Qiao, Kunhe Yang, Eric Zhao

TL;DR
This paper studies the truthfulness of calibration measures in sequential prediction, revealing that existing measures are not truthful and introducing a new measure, SSCE, that encourages truthful forecasting.
Contribution
It provides a taxonomy of calibration measures, demonstrates their lack of truthfulness, and proposes the SSCE as a truthful calibration measure.
Findings
Existing calibration measures are not truthful.
Simple distributions can exploit current measures for low penalties.
SSCE makes truthful prediction nearly optimal.
Abstract
We initiate the study of the truthfulness of calibration measures in sequential prediction. A calibration measure is said to be truthful if the forecaster (approximately) minimizes the expected penalty by predicting the conditional expectation of the next outcome, given the prior distribution of outcomes. Truthfulness is an important property of calibration measures, ensuring that the forecaster is not incentivized to exploit the system with deliberate poor forecasts. This makes it an essential desideratum for calibration measures, alongside typical requirements, such as soundness and completeness. We conduct a taxonomy of existing calibration measures and their truthfulness. Perhaps surprisingly, we find that all of them are far from being truthful. That is, under existing calibration measures, there are simple distributions on which a polylogarithmic (or even zero) penalty is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsScientific Measurement and Uncertainty Evaluation · Fault Detection and Control Systems
