Hidden Heterogeneity: When to Choose Similarity-Based Calibration
Kiri L. Wagstaff, Thomas G. Dietterich

TL;DR
This paper introduces a measure for hidden heterogeneity in classifiers and proposes similarity-weighted calibration methods that adapt locally to improve probability calibration, especially in subpopulations with unrecognized heterogeneity.
Contribution
It presents a quantitative measure for hidden heterogeneity and develops two local calibration methods that leverage similarity to enhance calibration accuracy.
Findings
Similarity-based calibration improves with more hidden heterogeneity.
Local calibration methods outperform global ones when HH is present.
HH serves as a diagnostic for local calibration effectiveness.
Abstract
Trustworthy classifiers are essential to the adoption of machine learning predictions in many real-world settings. The predicted probability of possible outcomes can inform high-stakes decision making, particularly when assessing the expected value of alternative decisions or the risk of bad outcomes. These decisions require well-calibrated probabilities, not just the correct prediction of the most likely class. Black-box classifier calibration methods can improve the reliability of a classifier's output without requiring retraining. However, these methods are unable to detect subpopulations where calibration could also improve prediction accuracy. Such subpopulations are said to exhibit "hidden heterogeneity" (HH), because the original classifier did not detect them. This paper proposes a quantitative measure for HH. It also introduces two similarity-weighted calibration methods that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification
