TL;DR
This paper introduces a distribution-free method for generating prediction sets with finite-sample guarantees, enabling reliable uncertainty quantification across diverse machine learning tasks without distributional assumptions.
Contribution
It presents a novel calibration framework for set-valued predictions that control expected loss at a specified level, applicable to various complex prediction problems.
Findings
Provides finite-sample guarantees for prediction sets in multiple tasks
Demonstrates effectiveness on large-scale classification, multi-label, hierarchical, segmentation, and protein prediction
Enables distribution-free, rigorous error control in practical settings
Abstract
While improving prediction accuracy has been the focus of machine learning in recent years, this alone does not suffice for reliable decision-making. Deploying learning systems in consequential settings also requires calibrating and communicating the uncertainty of predictions. To convey instance-wise uncertainty for prediction tasks, we show how to generate set-valued predictions from a black-box predictor that control the expected loss on future test points at a user-specified level. Our approach provides explicit finite-sample guarantees for any dataset by using a holdout set to calibrate the size of the prediction sets. This framework enables simple, distribution-free, rigorous error control for many tasks, and we demonstrate it in five large-scale machine learning problems: (1) classification problems where some mistakes are more costly than others; (2) multi-label classification,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Distribution-Free, Risk-Controlling Prediction Sets· youtube
