Hidden Heterogeneity: When to Choose Similarity-Based Calibration

Kiri L. Wagstaff; Thomas G. Dietterich

arXiv:2202.01840·cs.LG·February 22, 2023

Hidden Heterogeneity: When to Choose Similarity-Based Calibration

Kiri L. Wagstaff, Thomas G. Dietterich

PDF

Open Access 1 Repo

TL;DR

This paper introduces a measure for hidden heterogeneity in classifiers and proposes similarity-weighted calibration methods that adapt locally to improve probability calibration, especially in subpopulations with unrecognized heterogeneity.

Contribution

It presents a quantitative measure for hidden heterogeneity and develops two local calibration methods that leverage similarity to enhance calibration accuracy.

Findings

01

Similarity-based calibration improves with more hidden heterogeneity.

02

Local calibration methods outperform global ones when HH is present.

03

HH serves as a diagnostic for local calibration effectiveness.

Abstract

Trustworthy classifiers are essential to the adoption of machine learning predictions in many real-world settings. The predicted probability of possible outcomes can inform high-stakes decision making, particularly when assessing the expected value of alternative decisions or the risk of bad outcomes. These decisions require well-calibrated probabilities, not just the correct prediction of the most likely class. Black-box classifier calibration methods can improve the reliability of a classifier's output without requiring retraining. However, these methods are unable to detect subpopulations where calibration could also improve prediction accuracy. Such subpopulations are said to exhibit "hidden heterogeneity" (HH), because the original classifier did not detect them. This paper proposes a quantitative measure for HH. It also introduces two similarity-weighted calibration methods that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wkiri/simcalib
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification