Decision-Making under Miscalibration
Guy N. Rothblum, Gal Yona

TL;DR
This paper addresses how to optimally set decision thresholds when using potentially miscalibrated machine learning predictions in clinical settings, proposing a distribution-free approach that minimizes worst-case regret.
Contribution
It introduces a novel, distribution-free method for choosing decision thresholds under miscalibration, with closed-form solutions and validation on real clinical data.
Findings
Using the proposed threshold can improve clinical utility in certain cases.
The optimal threshold under miscalibration differs from the perfect calibration case.
Theoretical results are validated with real-world data.
Abstract
ML-based predictions are used to inform consequential decisions about individuals. How should we use predictions (e.g., risk of heart attack) to inform downstream binary classification decisions (e.g., undergoing a medical procedure)? When the risk estimates are perfectly calibrated, the answer is well understood: a classification problem's cost structure induces an optimal treatment threshold . In practice, however, some amount of miscalibration is unavoidable, raising a fundamental question: how should one use potentially miscalibrated predictions to inform binary decisions? We formalize a natural (distribution-free) solution concept: given anticipated miscalibration of , we propose using the threshold that minimizes the worst-case regret over all -miscalibrated predictors, where the regret is the difference in clinical utility between using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Decision-Making Under Miscalibration· youtube
Taxonomy
TopicsMachine Learning in Healthcare · Explainable Artificial Intelligence (XAI) · Advanced Bandit Algorithms Research
