On Optimal Interpolation In Linear Regression
Eduard Oravkin, Patrick Rebeschini

TL;DR
This paper derives the optimal linear response-interpolating estimator in linear regression, showing how it can outperform the minimum-norm interpolator and extending the analysis to random features regression.
Contribution
It provides a closed-form expression for the optimal interpolator in linear regression and connects it to gradient descent limits, advancing understanding of interpolation and generalization.
Findings
Optimal interpolator derived in closed form.
Minimum-norm interpolator can generalize worse than the optimal.
Extension of optimal interpolation concept to random features regression.
Abstract
Understanding when and why interpolating methods generalize well has recently been a topic of interest in statistical learning theory. However, systematically connecting interpolating methods to achievable notions of optimality has only received partial attention. In this paper, we investigate the question of what is the optimal way to interpolate in linear regression using functions that are linear in the response variable (as the case for the Bayes optimal estimator in ridge regression) and depend on the data, the population covariance of the data, the signal-to-noise ratio and the covariance of the prior for the signal, but do not depend on the value of the signal itself nor the noise vector in the training data. We provide a closed-form expression for the interpolator that achieves this notion of optimality and show that it can be derived as the limit of preconditioned gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference
MethodsLinear Regression
