# Statistical inference with F-statistics when fitting simple models to   high-dimensional data

**Authors:** Hannes Leeb, Lukas Steinberger

arXiv: 1902.04304 · 2019-02-13

## TL;DR

This paper investigates the validity of F-tests in high-dimensional linear models where the number of predictors exceeds the number of observations, showing asymptotic correctness even under model misspecification.

## Contribution

It provides theoretical results demonstrating the asymptotic validity of F-tests for simple linear models in high-dimensional settings, despite potential misspecification.

## Key findings

- F-test remains valid asymptotically in high-dimensional regimes
- Validity holds even when the simple model is misspecified
- Results applicable to models with many more predictors than observations

## Abstract

We study linear subset regression in the context of the high-dimensional overall model $y = \vartheta+\theta' z + \epsilon$ with univariate response $y$ and a $d$-vector of random regressors $z$, independent of $\epsilon$. Here, "high-dimensional" means that the number $d$ of available explanatory variables is much larger than the number $n$ of observations. We consider simple linear sub-models where $y$ is regressed on a set of $p$ regressors given by $x = M'z$, for some $d \times p$ matrix $M$ of full rank $p < n$. The corresponding simple model, i.e., $y=\alpha+\beta' x + e$, can be justified by imposing appropriate restrictions on the unknown parameter $\theta$ in the overall model; otherwise, this simple model can be grossly misspecified. In this paper, we establish asymptotic validity of the standard $F$-test on the surrogate parameter $\beta$, in an appropriate sense, even when the simple model is misspecified.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.04304/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1902.04304/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/1902.04304/full.md

---
Source: https://tomesphere.com/paper/1902.04304