Cautious Calibration in Binary Classification
Mari-Liis Allikivi, Joonas J\"arve, Meelis Kull

TL;DR
This paper introduces cautious calibration in binary classification, focusing on producing underconfident probability estimates to improve trustworthiness in high-risk decision-making scenarios, supported by a theoretically grounded method and experimental validation.
Contribution
It proposes the novel concept of cautious calibration, emphasizing underconfidence in probability estimates and providing a theoretically grounded learning method for this purpose.
Findings
Our method consistently produces cautious, underconfident estimates.
It outperforms existing approaches in high-risk scenarios.
Establishes a baseline for future cautious calibration research.
Abstract
Being cautious is crucial for enhancing the trustworthiness of machine learning systems integrated into decision-making pipelines. Although calibrated probabilities help in optimal decision-making, perfect calibration remains unattainable, leading to estimates that fluctuate between under- and overconfidence. This becomes a critical issue in high-risk scenarios, where even occasional overestimation can lead to extreme expected costs. In these scenarios, it is important for each predicted probability to lean towards underconfidence, rather than just achieving an average balance. In this study, we introduce the novel concept of cautious calibration in binary classification. This approach aims to produce probability estimates that are intentionally underconfident for each predicted probability. We highlight the importance of this approach in a high-risk scenario and propose a theoretically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Machine Learning and Data Classification
