Optimally Efficient Sequential Calibration of Binary Classifiers to Minimize Classification Error
Kaan Gokcesu, Hakan Gokcesu

TL;DR
This paper introduces a sequential method for calibrating binary classifiers that finds an optimal hard mapping to minimize classification error, with proven efficiency and applicability to weighted and linear loss functions.
Contribution
It demonstrates that the optimal calibration mapping is a hard threshold and proposes a logarithmic time complexity recursive algorithm to find it.
Findings
The optimal calibration mapping is a hard threshold for minimizing classification error.
The proposed recursive merger approach is computationally efficient, with logarithmic time complexity.
The method applies to weighted and linear loss functions, maintaining the hard mapping property.
Abstract
In this work, we aim to calibrate the score outputs of an estimator for the binary classification problem by finding an 'optimal' mapping to class probabilities, where the 'optimal' mapping is in the sense that minimizes the classification error (or equivalently, maximizes the accuracy). We show that for the given target variables and the score outputs of an estimator, an 'optimal' soft mapping, which monotonically maps the score values to probabilities, is a hard mapping that maps the score values to and . We show that for class weighted (where the accuracy for one class is more important) and sample weighted (where the samples' accurate classifications are not equally important) errors, or even general linear losses; this hard mapping characteristic is preserved. We propose a sequential recursive merger approach, which produces an 'optimal' hard mapping (for the observed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Imbalanced Data Classification Techniques
