Recalibrating binary probabilistic classifiers
Dirk Tasche

TL;DR
This paper introduces new recalibration methods for binary classifiers based on distribution shift assumptions, demonstrating their effectiveness in credit risk evaluation scenarios.
Contribution
The paper proposes two novel recalibration techniques, CSPD and QMM, grounded in distribution shift theory, enhancing classifier calibration under prior probability constraints.
Findings
QMM provides conservative calibration results
Methods perform well under distribution shift assumptions
Improved calibration for credit risk applications
Abstract
Recalibration of binary probabilistic classifiers to a target prior probability is an important task in areas like credit risk management. However, recalibration of a classifier learned on a training dataset to a target on a test dataset in general is not a well-defined problem because there might be more than one way to transform the original posterior probabilities such that the target is matched. In this paper, methods for recalibration are analysed from a distribution shift perspective. Distribution shift assumptions linked to the area under the curve (AUC) of a probabilistic classifier are found to be useful for the design of meaningful recalibration methods. Two new methods called parametric covariate shift with posterior drift (CSPD) and ROC-based quasi moment matching (QMM) are proposed and tested together with some other methods in an example setting. The outcomes of the test…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
