Characterizing the Training-Conditional Coverage of Full Conformal Inference in High Dimensions
Isaac Gibbs, Emmanuel J. Cand\`es

TL;DR
This paper analyzes the coverage properties of full conformal inference in high-dimensional linear regression, showing it concentrates at the target coverage level conditionally on training data, and compares it with simpler methods.
Contribution
It provides the first theoretical analysis of training-conditional coverage of full conformal inference in high dimensions, demonstrating its necessity and limitations.
Findings
Full conformal coverage concentrates at the target level in high dimensions.
Simple residual-based methods exhibit undercoverage bias.
Full conformal remains valid for tuning regularization without test data.
Abstract
We study the coverage properties of full conformal regression in the proportional asymptotic regime where the ratio of the dimension and the sample size converges to a constant. In this setting, existing theory tells us only that full conformal inference is unbiased, in the sense that its average coverage lies at the desired level when marginalized over both the new test point and the training data. Considerably less is known about the behaviour of these methods conditional on the training set. As a result, the exact benefits of full conformal inference over much simpler alternative methods is unclear. This paper investigates the behaviour of full conformal inference and natural uncorrected alternatives for a broad class of -regularized linear regression models. We show that in the proportional asymptotic regime the training-conditional coverage of full conformal inference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Stochastic Gradient Optimization Techniques · Advanced Statistical Methods and Models
