Algebraic and Statistical Properties of the Partially Regularized Ordinary Least Squares Interpolator
Letian Yang, Dennis Shen

TL;DR
This paper investigates the algebraic and statistical properties of partially regularized OLS interpolators in high-dimensional linear regression, extending classical formulas and developing variance estimators for inference.
Contribution
It extends Cochran's and leave-one-out formulas to partial regularization, and designs variance estimators for statistical inference in high-dimensional settings.
Findings
Extended classical formulas for partial regularization
Developed variance estimators for inference
Validated estimators through simulations
Abstract
Modern deep learning has revealed a surprising statistical phenomenon known as benign overfitting, with high-dimensional linear regression being a prominent example. This paper contributes to ongoing research on the ordinary least squares (OLS) interpolator, focusing on the partial regression setting, where only a subset of coefficients is implicitly regularized. On the algebraic front, we extend Cochran's formula and the leave-one-out residual formula for the partial regularization framework. On the stochastic front, we leverage our algebraic results to design several homoskedastic variance estimators under the Gauss-Markov model. These estimators serve as a basis for conducting statistical inference, albeit with slight conservatism in their performance. Through simulations, we study the finite-sample properties of these variance estimators across various generative models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical and numerical algorithms
