Distribution-free inference for regression: discrete, continuous, and in between
Yonghoon Lee, Rina Foygel Barber

TL;DR
This paper explores distribution-free inference for regression across different types of feature distributions, identifying regimes where confidence intervals can have vanishing width based on the support size relative to sample size.
Contribution
It characterizes the regimes between finite and continuous feature distributions where distribution-free confidence intervals with vanishing width are possible.
Findings
Vanishing-width confidence intervals are achievable when the effective support size is less than the square of the sample size.
In continuous feature settings, confidence intervals must have non-vanishing width.
The paper delineates distinct regimes based on the support size of the feature distribution.
Abstract
In data analysis problems where we are not able to rely on distributional assumptions, what types of inference guarantees can still be obtained? Many popular methods, such as holdout methods, cross-validation methods, and conformal prediction, are able to provide distribution-free guarantees for predictive inference, but the problem of providing inference for the underlying regression function (for example, inference on the conditional mean ) is more challenging. In the setting where the features are continuously distributed, recent work has established that any confidence interval for must have non-vanishing width, even as sample size tends to infinity. At the other extreme, if takes only a small number of possible values, then inference on is trivial to achieve. In this work, we study the problem in settings in between these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStatistical Methods and Inference · Machine Learning and Algorithms · Markov Chains and Monte Carlo Methods
