Bootstrapping and Sample Splitting For High-Dimensional, Assumption-Free Inference
Alessandro Rinaldo, Larry Wasserman, Max G'Sell, Jing Lei

TL;DR
This paper revisits sample splitting combined with bootstrap methods to provide a simple, assumption-free approach for high-dimensional inference, establishing new accuracy bounds and introducing improved variable importance measures.
Contribution
It introduces a bootstrap-based, assumption-free inference method for high-dimensional models, with new accuracy bounds and improved variable importance parameters.
Findings
Bootstrap and Normal approximation bounds for nonlinear parameters
Image bootstrap offers higher coverage accuracy
Splitting enhances inference robustness but may reduce prediction accuracy
Abstract
Several new methods have been proposed for performing valid inference after model selection. An older method is sampling splitting: use part of the data for model selection and part for inference. In this paper we revisit sample splitting combined with the bootstrap (or the Normal approximation). We show that this leads to a simple, assumption-free approach to inference and we establish results on the accuracy of the method. In fact, we find new bounds on the accuracy of the bootstrap and the Normal approximation for general nonlinear parameters with increasing dimension which we then use to assess the accuracy of regression inference. We show that an alternative, called the image bootstrap, has higher coverage accuracy at the cost of more computation. We define new parameters that measure variable importance and that can be inferred with greater accuracy than the usual regression…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Fault Detection and Control Systems · Gaussian Processes and Bayesian Inference
