Testing linear hypotheses in high-dimensional regressions
Z. Bai, D. Jiang, J. Yao, S. Zheng

TL;DR
This paper develops a high-dimensional correction to Wilk's likelihood ratio test for multivariate linear models, enabling accurate hypothesis testing when both data dimension and sample size are large.
Contribution
The authors propose a novel asymptotic correction based on random matrix theory for Wilk's test in high-dimensional settings, improving its accuracy and applicability.
Findings
Corrected LRT is asymptotically Gaussian under the null hypothesis.
Simulation results show the corrected test maintains size and power for large p and n.
Standard chi-square approximation fails in high-dimensional contexts.
Abstract
For a multivariate linear model, Wilk's likelihood ratio test (LRT) constitutes one of the cornerstone tools. However, the computation of its quantiles under the null or the alternative requires complex analytic approximations and more importantly, these distributional approximations are feasible only for moderate dimension of the dependent variable, say . On the other hand, assuming that the data dimension as well as the number of regression variables are fixed while the sample size grows, several asymptotic approximations are proposed in the literature for Wilk's including the widely used chi-square approximation. In this paper, we consider necessary modifications to Wilk's test in a high-dimensional context, specifically assuming a high data dimension and a large sample size . Based on recent random matrix theory, the correction we propose to Wilk's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
