Conformal Prediction with Missing Values
Margaux Zaffran, Aymeric Dieuleveut, Julie Josse, Yaniv Romano

TL;DR
This paper extends conformal prediction to handle missing data in covariates, proposing a new framework that ensures valid prediction intervals conditioned on missing data patterns, with theoretical guarantees and empirical validation.
Contribution
It introduces a generalized conformalized quantile regression framework with missing data augmentation, providing conditional coverage guarantees despite complex missingness patterns.
Findings
Conditional coverage is maintained on imputed data.
The proposed method outperforms existing approaches in experiments.
The framework is effective on synthetic and real critical care data.
Abstract
Conformal prediction is a theoretically grounded framework for constructing predictive intervals. We study conformal prediction with missing values in the covariates -- a setting that brings new challenges to uncertainty quantification. We first show that the marginal coverage guarantee of conformal prediction holds on imputed data for any missingness distribution and almost all imputation functions. However, we emphasize that the average coverage varies depending on the pattern of missing values: conformal methods tend to construct prediction intervals that under-cover the response conditionally to some missing patterns. This motivates our novel generalized conformalized quantile regression framework, missing data augmentation, which yields prediction intervals that are valid conditionally to the patterns of missing values, despite their exponential number. We then show that a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference
