Hypothesis Testing in High-Dimensional Regression under the Gaussian Random Design Model: Asymptotic Theory
Adel Javanmard, Andrea Montanari

TL;DR
This paper develops a theoretical framework for computing p-values in high-dimensional linear regression using the Lasso, providing asymptotic distributional results and nearly optimal testing procedures for Gaussian random design matrices.
Contribution
It introduces a debiasing approach for the Lasso estimator to accurately assess statistical significance and characterizes the asymptotic distribution under Gaussian designs.
Findings
Derived upper bounds on the minimax power of tests at given significance levels.
Proposed a practical procedure nearly achieving the theoretical power bounds.
Established the standard distributional limit for Gaussian design matrices with large sample sizes.
Abstract
We consider linear regression in the high-dimensional regime where the number of observations is smaller than the number of parameters . A very successful approach in this setting uses -penalized least squares (a.k.a. the Lasso) to search for a subset of parameters that best explain the data, while setting the other parameters to zero. Considerable amount of work has been devoted to characterizing the estimation and model selection problems within this approach. In this paper we consider instead the fundamental, but far less understood, question of \emph{statistical significance}. More precisely, we address the problem of computing p-values for single regression coefficients. On one hand, we develop a general upper bound on the minimax power of tests with a given significance level. On the other, we prove that this upper bound is (nearly) achievable through a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Regression
