Powerful batch conformal prediction for classification
Ulysse Gazin, Ruth Heller, Etienne Roquain, Aldo Solari

TL;DR
This paper introduces a powerful conformal prediction method for classification that constructs valid batch prediction sets with improved efficiency by leveraging conformal p-values and the Simes inequality, applicable even under label shift.
Contribution
It proposes a novel approach for batch conformal prediction using conformal p-value combinations, offering increased power over Bonferroni correction and providing theoretical guarantees under various data assumptions.
Findings
The proposed method yields narrower prediction sets compared to Bonferroni correction.
The approach remains valid under label distribution shift.
Empirical results demonstrate improved performance on synthetic and real data.
Abstract
In a split conformal framework with classes, a calibration sample of labeled examples is observed for inference on the label of a new unlabeled example. We explore the setting where a `batch' of independent such unlabeled examples is given, and the goal is to construct a batch prediction set with 1- coverage. Unlike individual prediction sets, the batch prediction set is a collection of label vectors of size , while the calibration sample consists of univariate labels. A natural approach is to apply the Bonferroni correction, which concatenates individual prediction sets at level . We propose a uniformly more powerful solution, based on specific combinations of conformal -values that exploit the Simes inequality. We provide a general recipe for valid inference with any combinations of conformal -values, and compare the performance of several…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Imbalanced Data Classification Techniques · Data Stream Mining Techniques
MethodsSparse Evolutionary Training
