The relative effects of dimensionality and multiplicity of hypotheses on the F-test in linear regression
Lukas Steinberger

TL;DR
This paper investigates how the number of hypotheses and regressors affects the power of the F-test in high-dimensional linear regression, revealing that fewer hypotheses mitigate the negative impact of many regressors.
Contribution
It extends previous work by analyzing the effect of multiple hypotheses and non-Gaussian regressors on the F-test's power in high-dimensional settings.
Findings
Power decreases as the number of regressors increases.
Testing fewer hypotheses reduces the negative impact of high dimensionality.
Results are generalized to non-Gaussian regressors using recent covariance matrix theory.
Abstract
Recently, several authors have re-examined the power of the classical F-test in linear regression in a `large-p, large-n' framework (cf. Zhong and Chen (2011), Wang and Cui (2013)). They highlight the loss of power as the number of regressors p increases relative to sample size n. These papers essentially focus only on the overall test of the null hypothesis that all p slope coefficients are equal to zero. Here, we consider the general case of testing q linear hypotheses on the (p+1)-dimensional regression parameter vector that includes p slope coefficients and an intercept parameter. In the case of Gaussian design, we describe the dependence of the local asymptotic power function on both the relative number of parameters p and the number of hypotheses q being tested, showing that the negative effect of dimensionality is less severe if the number of hypotheses is small. Using the recent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
