Threshold Choice Methods: the Missing Link
Jos\'e Hern\'andez-Orallo, Peter Flach, C\`esar Ferri

TL;DR
This paper examines how different threshold choice methods impact classification performance metrics across various operating conditions, highlighting the importance of calibration and providing a systematic approach to minimize expected loss.
Contribution
It introduces a comprehensive analysis of threshold choice methods and their influence on performance metrics, connecting them through a unified framework and emphasizing calibration's role.
Findings
Different threshold methods correspond to specific performance metrics.
Calibration significantly affects threshold choice effectiveness.
Systematic comparison of threshold methods improves loss minimization.
Abstract
Many performance metrics have been introduced for the evaluation of classification performance, with different origins and niches of application: accuracy, macro-accuracy, area under the ROC curve, the ROC convex hull, the absolute error, and the Brier score (with its decomposition into refinement and calibration). One way of understanding the relation among some of these metrics is the use of variable operating conditions (either in the form of misclassification costs or class proportions). Thus, a metric may correspond to some expected loss over a range of operating conditions. One dimension for the analysis has been precisely the distribution we take for this range of operating conditions, leading to some important connections in the area of proper scoring rules. However, we show that there is another dimension which has not received attention in the analysis of performance metrics.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Statistical Methods in Clinical Trials · Data Mining Algorithms and Applications
