Estimating calibration error under label shift without labels

Teodora Popordanoska; Gorjan Radevski; Tinne Tuytelaars; Matthew B.; Blaschko

arXiv:2312.08586·cs.LG·December 15, 2023·1 cites

Estimating calibration error under label shift without labels

Teodora Popordanoska, Gorjan Radevski, Tinne Tuytelaars, Matthew B., Blaschko

PDF

Open Access

TL;DR

This paper introduces a new method for estimating calibration error in machine learning models under label shift without requiring target domain labels, ensuring reliable calibration assessment in deployment scenarios.

Contribution

It proposes a novel importance re-weighting based estimator for calibration error under label shift, which is consistent and asymptotically unbiased without target labels.

Findings

01

Effective across diverse real-world datasets

02

Reliable under various label-shift conditions

03

Outperforms existing calibration estimators

Abstract

In the face of dataset shift, model calibration plays a pivotal role in ensuring the reliability of machine learning systems. Calibration error (CE) is an indicator of the alignment between the predicted probabilities and the classifier accuracy. While prior works have delved into the implications of dataset shift on calibration, existing CE estimators assume access to labels from the target domain, which are often unavailable in practice, i.e., when the model is deployed and used. This work addresses such challenging scenario, and proposes a novel CE estimator under label shift, which is characterized by changes in the marginal label distribution $p (Y)$ , while keeping the conditional $p (X ∣ Y)$ constant between the source and target distributions. Our contribution is an approach, which, by leveraging importance re-weighting of the labeled source distribution, provides consistent and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Water Systems and Optimization · Hydrological Forecasting Using AI