Binary Classification with Bounded Abstention Rate
Shubhanshu Shekhar, Mohammad Ghavamzadeh, Tara Javidi

TL;DR
This paper studies binary classification with abstention under a bounded-rate constraint, providing a characterization of the optimal classifier, proposing a practical plug-in classifier, and demonstrating its near-optimality and empirical performance.
Contribution
It generalizes the characterization of the Bayes optimal classifier for bounded abstention and introduces a computationally efficient algorithm with theoretical guarantees.
Findings
The proposed classifier satisfies abstention constraints with high probability.
The algorithm achieves near-minimax optimal excess risk under standard assumptions.
Empirical results show competitive performance on UCI datasets.
Abstract
We consider the problem of binary classification with abstention in the relatively less studied \emph{bounded-rate} setting. We begin by obtaining a characterization of the Bayes optimal classifier for an arbitrary input-label distribution . Our result generalizes and provides an alternative proof for the result first obtained by \cite{chow1957optimum}, and then re-derived by \citet{denis2015consistency}, under a continuity assumption on . We then propose a plug-in classifier that employs unlabeled samples to decide the region of abstention and derive an upper-bound on the excess risk of our classifier under standard \emph{H\"older smoothness} and \emph{margin} assumptions. Unlike the plug-in rule of \citet{denis2015consistency}, our constructed classifier satisfies the abstention constraint with high probability and can also deal with discontinuities in the empirical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
