Valid inferential models for prediction in supervised learning problems

Leonardo Cella; Ryan Martin

arXiv:2112.10234·math.ST·November 22, 2022·Int. J. Approx. Reason.

Valid inferential models for prediction in supervised learning problems

Leonardo Cella, Ryan Martin

PDF

TL;DR

This paper advocates for the use of valid probabilistic predictors in supervised learning, which provide reliable uncertainty quantification and connect to conformal prediction, enhancing frequentist error control.

Contribution

It introduces a formal notion of validity for probabilistic predictors, demonstrates their imprecision, and offers a general construction linked to conformal prediction for regression and classification.

Findings

01

Valid probabilistic predictors must be imprecise

02

They avoid sure loss and ensure error rate control

03

The construction is applicable to regression and classification

Abstract

Prediction, where observed data is used to quantify uncertainty about a future observation, is a fundamental problem in statistics. Prediction sets with coverage probability guarantees are a common solution, but these do not provide probabilistic uncertainty quantification in the sense of assigning beliefs to relevant assertions about the future observable. Alternatively, we recommend the use of a {\em probabilistic predictor}, a data-dependent (imprecise) probability distribution for the to-be-predicted observation given the observed data. It is essential that the probabilistic predictor be reliable or valid, and here we offer a notion of validity and explore its behavioral and statistical implications. In particular, we show that valid probabilistic predictors must be imprecise, that they avoid sure loss, and that they lead to prediction procedures with desirable frequentist error…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.