Distribution-Free, Risk-Controlling Prediction Sets

Stephen Bates; Anastasios Angelopoulos; Lihua Lei; Jitendra; Malik; Michael I. Jordan

arXiv:2101.02703·cs.LG·August 6, 2021

Distribution-Free, Risk-Controlling Prediction Sets

Stephen Bates, Anastasios Angelopoulos, Lihua Lei, Jitendra, Malik, Michael I. Jordan

PDF

3 Repos 1 Video

TL;DR

This paper introduces a distribution-free method for generating prediction sets with finite-sample guarantees, enabling reliable uncertainty quantification across diverse machine learning tasks without distributional assumptions.

Contribution

It presents a novel calibration framework for set-valued predictions that control expected loss at a specified level, applicable to various complex prediction problems.

Findings

01

Provides finite-sample guarantees for prediction sets in multiple tasks

02

Demonstrates effectiveness on large-scale classification, multi-label, hierarchical, segmentation, and protein prediction

03

Enables distribution-free, rigorous error control in practical settings

Abstract

While improving prediction accuracy has been the focus of machine learning in recent years, this alone does not suffice for reliable decision-making. Deploying learning systems in consequential settings also requires calibrating and communicating the uncertainty of predictions. To convey instance-wise uncertainty for prediction tasks, we show how to generate set-valued predictions from a black-box predictor that control the expected loss on future test points at a user-specified level. Our approach provides explicit finite-sample guarantees for any dataset by using a holdout set to calibrate the size of the prediction sets. This framework enables simple, distribution-free, rigorous error control for many tasks, and we demonstrate it in five large-scale machine learning problems: (1) classification problems where some mistakes are more costly than others; (2) multi-label classification,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Distribution-Free, Risk-Controlling Prediction Sets· youtube