# A New Look at an Old Problem: A Universal Learning Approach to Linear   Regression

**Authors:** Koby Bibas, Yaniv Fogel, Meir Feder

arXiv: 1905.04708 · 2019-11-11

## TL;DR

This paper applies universal learning principles to linear regression, providing analytical solutions and insights into generalization capabilities, especially in over-parameterized settings where the number of features exceeds the training samples.

## Contribution

It introduces an analytical pNML solution for universal learning in linear regression, including cases with more parameters than data points, and explores conditions for successful generalization.

## Key findings

- Learnability depends on the test vector’s subspace alignment.
- Linear regression can generalize in over-parameterized models under certain conditions.
- Simulation demonstrates polynomial fitting with high-degree models.

## Abstract

Linear regression is a classical paradigm in statistics. A new look at it is provided via the lens of universal learning. In applying universal learning to linear regression the hypotheses class represents the label $y\in {\cal R}$ as a linear combination of the feature vector $x^T\theta$ where $x\in {\cal R}^M$, within a Gaussian error. The Predictive Normalized Maximum Likelihood (pNML) solution for universal learning of individual data can be expressed analytically in this case, as well as its associated learnability measure. Interestingly, the situation where the number of parameters $M$ may even be larger than the number of training samples $N$ can be examined. As expected, in this case learnability cannot be attained in every situation; nevertheless, if the test vector resides mostly in a subspace spanned by the eigenvectors associated with the large eigenvalues of the empirical correlation matrix of the training data, linear regression can generalize despite the fact that it uses an ``over-parametrized'' model. We demonstrate the results with a simulation of fitting a polynomial to data with a possibly large polynomial degree.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.04708/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1905.04708/full.md

## References

17 references — full list in the complete paper: https://tomesphere.com/paper/1905.04708/full.md

---
Source: https://tomesphere.com/paper/1905.04708