All of Linear Regression
Arun K. Kuchibhotla, Lawrence D. Brown, Andreas Buja, Junhui Cai

TL;DR
This paper provides a unified, finite-sample analysis of ordinary least squares (OLS) regression, addressing convergence, dependence, variable selection, and inference in high-dimensional and potentially dependent data settings.
Contribution
It introduces a simple deterministic inequality that underpins finite-sample results for OLS under various conditions, including high-dimensional and dependent data, and studies inference after variable selection.
Findings
Finite-sample bounds for OLS convergence.
Analysis of inference after variable selection with increasing covariates.
First study of inference post-variable selection in high-dimensional settings.
Abstract
Least squares linear regression is one of the oldest and widely used data analysis tools. Although the theoretical analysis of the ordinary least squares (OLS) estimator is as old, several fundamental questions are yet to be answered. Suppose regression observations (not necessarily independent) are available. Some of the questions we deal with are as follows: under what conditions, does the OLS estimator converge and what is the limit? What happens if the dimension is allowed to grow with ? What happens if the observations are dependent with dependence possibly strengthening with ? How to do statistical inference under these kinds of misspecification? What happens to the OLS estimator under variable selection? How to do inference under misspecification and variable selection? We answer all the questions raised above…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Sparse and Compressive Sensing Techniques
