`Basic' Generalization Error Bounds for Least Squares Regression with Well-specified Models
Karthik Duraisamy

TL;DR
This paper derives non-asymptotic, quantitative generalization error bounds for well-specified linear least squares regression models, supported by analytical formulas and numerical experiments, providing accessible insights into their predictive performance.
Contribution
It offers new non-asymptotic bounds for the generalization error of well-specified linear regression, with explicit formulas and pedagogical derivations.
Findings
Bounds are non-asymptotic and quantitative.
Analytical formulas are validated by numerical experiments.
Provides accessible derivations for understanding generalization in linear regression.
Abstract
This note examines the behavior of generalization capabilities - as defined by out-of-sample mean squared error (MSE) - of Linear Gaussian (with a fixed design matrix) and Linear Least Squares regression. Particularly, we consider a well-specified model setting, i.e. we assume that there exists a `true' combination of model parameters within the chosen model form. While the statistical properties of Least Squares regression have been extensively studied over the past few decades - particularly with {\bf less restrictive problem statements} compared to the present work - this note targets bounds that are {\bf non-asymptotic and more quantitative} compared to the literature. Further, the analytical formulae for distributions and bounds (on the MSE) are directly compared to numerical experiments. Derivations are presented in a self-contained and pedagogical manner, in a way that a reader…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Statistical Methods and Bayesian Inference
