Conformal Validity Guarantees Exist for Any Data Distribution (and How   to Find Them)

Drew Prinster; Samuel Stanton; Anqi Liu; Suchi Saria

arXiv:2405.06627·cs.LG·June 6, 2024

Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)

Drew Prinster, Samuel Stanton, Anqi Liu, Suchi Saria

PDF

Open Access 1 Repo

TL;DR

This paper proves that conformal prediction guarantees can be extended to any data distribution, including those with sequential shifts, and provides practical algorithms for AI/ML applications.

Contribution

It introduces a theoretical extension of conformal prediction to all joint distributions and offers practical algorithms for handling sequential data shifts in AI/ML.

Findings

01

Conformal prediction validity extends beyond exchangeable data.

02

Practical algorithms derived for covariate shifts in AI/ML.

03

Empirical evaluation on synthetic optimization and active learning tasks.

Abstract

As artificial intelligence (AI) / machine learning (ML) gain widespread adoption, practitioners are increasingly seeking means to quantify and control the risk these systems incur. This challenge is especially salient when such systems have autonomy to collect their own data, such as in black-box optimization and active learning, where their actions induce sequential feedback-loop shifts in the data distribution. Conformal prediction is a promising approach to uncertainty and risk quantification, but prior variants' validity guarantees have assumed some form of ``quasi-exchangeability'' on the data distribution, thereby excluding many types of sequential shifts. In this paper we prove that conformal prediction can theoretically be extended to \textit{any} joint data distribution, not just exchangeable or quasi-exchangeable ones. Although the most general case is exceedingly impractical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

drewprinster/conformal-mfcs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Advanced Statistical Process Monitoring