Invariance-based Inference in High-Dimensional Regression with Finite-Sample Guarantees
Wenxuan Guo, Panos Toulis

TL;DR
This paper introduces simple, invariance-based randomization tests for high-dimensional regression that provide finite-sample validity and power guarantees, with optimality under Gaussian design, and demonstrate robustness in simulations and real data.
Contribution
It develops novel invariance-based inference procedures with finite-sample guarantees for high-dimensional regression, including global and partial null hypothesis testing, and shows their optimality and robustness.
Findings
Finite-sample valid tests for global null hypothesis.
Residual-based tests for partial null hypotheses.
Robustness to multicollinearity and heavy-tailed covariates.
Abstract
In this paper, we develop invariance-based procedures for testing and inference in high-dimensional regression models. These procedures, also known as randomization tests, provide several important advantages. First, for the global null hypothesis of significance, our test is valid in finite samples. It is also simple to implement and comes with finite-sample guarantees on statistical power. Remarkably, despite its simplicity, this testing idea has escaped the attention of earlier analytical work, which mainly concentrated on complex high-dimensional asymptotic methods. Under an additional assumption of Gaussian design, we show that this test also achieves the minimax optimal rate against certain nonsparse alternatives, a type of result that is rare in the literature. Second, for partial null hypotheses, we propose residual-based tests and derive theoretical conditions for their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Statistical Methods and Bayesian Inference
