`Basic' Generalization Error Bounds for Least Squares Regression with   Well-specified Models

Karthik Duraisamy

arXiv:2109.09647·math.ST·September 21, 2021

`Basic' Generalization Error Bounds for Least Squares Regression with Well-specified Models

Karthik Duraisamy

PDF

Open Access

TL;DR

This paper derives non-asymptotic, quantitative generalization error bounds for well-specified linear least squares regression models, supported by analytical formulas and numerical experiments, providing accessible insights into their predictive performance.

Contribution

It offers new non-asymptotic bounds for the generalization error of well-specified linear regression, with explicit formulas and pedagogical derivations.

Findings

01

Bounds are non-asymptotic and quantitative.

02

Analytical formulas are validated by numerical experiments.

03

Provides accessible derivations for understanding generalization in linear regression.

Abstract

This note examines the behavior of generalization capabilities - as defined by out-of-sample mean squared error (MSE) - of Linear Gaussian (with a fixed design matrix) and Linear Least Squares regression. Particularly, we consider a well-specified model setting, i.e. we assume that there exists a `true' combination of model parameters within the chosen model form. While the statistical properties of Least Squares regression have been extensively studied over the past few decades - particularly with {\bf less restrictive problem statements} compared to the present work - this note targets bounds that are {\bf non-asymptotic and more quantitative} compared to the literature. Further, the analytical formulae for distributions and bounds (on the MSE) are directly compared to numerical experiments. Derivations are presented in a self-contained and pedagogical manner, in a way that a reader…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Statistical Methods and Bayesian Inference