Predictive Model Degrees of Freedom in Linear Regression

Bo Luan; Yoonkyung Lee; Yunzhang Zhu

arXiv:2106.15682·math.ST·July 4, 2022·1 cites

Predictive Model Degrees of Freedom in Linear Regression

Bo Luan, Yoonkyung Lee, Yunzhang Zhu

PDF

Open Access

TL;DR

This paper introduces a new measure of model complexity for linear regression that explains the success of overparametrized models and reconciles the double descent phenomenon with classical theory.

Contribution

It proposes an adjusted degrees of freedom measure based on covariance, extending classical concepts to modern interpolating models.

Findings

01

The new measure differentiates among interpolating models.

02

It explains the double descent phenomenon.

03

It extends classical model complexity theory.

Abstract

Overparametrized interpolating models have drawn increasing attention from machine learning. Some recent studies suggest that regularized interpolating models can generalize well. This phenomenon seemingly contradicts the conventional wisdom that interpolation tends to overfit the data and performs poorly on test data. Further, it appears to defy the bias-variance trade-off. As one of the shortcomings of the existing theory, the classical notion of model degrees of freedom fails to explain the intrinsic difference among the interpolating models since it focuses on estimation of in-sample prediction error. This motivates an alternative measure of model complexity which can differentiate those interpolating models and take different test points into account. In particular, we propose a measure with a proper adjustment based on the squared covariance between the predictions and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques