Prediction and outlier detection in classification problems
Leying Guan, Rob Tibshirani

TL;DR
This paper introduces BCOPS, a method combining conformal prediction and supervised learning to create prediction sets that handle distribution shifts and detect outliers in multi-class classification, with theoretical guarantees.
Contribution
The paper proposes BCOPS, a novel approach that optimizes prediction sets for out-of-sample performance and outlier detection, with proven asymptotic consistency and finite-sample coverage guarantees.
Findings
BCOPS effectively detects outliers and maintains coverage.
The method demonstrates asymptotic optimality.
Real data examples validate the approach.
Abstract
We consider the multi-class classification problem when the training data and the out-of-sample test data may have different distributions and propose a method called BCOPS (balanced and conformal optimized prediction sets). BCOPS constructs a prediction set as a subset of class labels, possibly empty. It tries to optimize the out-of-sample performance, aiming to include the correct class as often as possible, but also detecting outliers , for which the method returns no prediction (corresponding to equal to the empty set). The proposed method combines supervised-learning algorithms with the method of conformal prediction to minimize a misclassification loss averaged over the out-of-sample distribution. The constructed prediction sets have a finite-sample coverage guarantee without distributional assumptions. We also propose a method to estimate the outlier detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Anomaly Detection Techniques and Applications · Advanced Statistical Process Monitoring
