TL;DR
This paper analyzes the predictive risk of ridge regression in high-dimensional, non-i.i.d. data with a variance profile, revealing phenomena like double descent and providing deterministic risk equivalents.
Contribution
It introduces a novel high-dimensional analysis of ridge regression with variance profiles, extending random matrix theory tools to non-i.i.d. data.
Findings
Deterministic equivalents for predictive risk and degrees of freedom are derived.
Double descent phenomenon appears under certain variance profiles.
Risk shape varies with different variance profiles, sometimes deviating from double descent.
Abstract
High-dimensional linear regression has been thoroughly studied in the context of independent and identically distributed data. We propose to investigate high-dimensional regression models for independent but non-identically distributed data. To this end, we suppose that the set of observed predictors (or features) is a random matrix with a variance profile and with dimensions growing at a proportional rate. Assuming a random effect model, we study the predictive risk of the ridge estimator for linear regression with such a variance profile. In this setting, we provide deterministic equivalents of this risk and of the degree of freedom of the ridge estimator. For certain class of variance profile, our work highlights the emergence of the well-known double descent phenomenon in high-dimensional regression for the minimum norm least-squares estimator when the ridge regularization parameter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Random Matrices and Applications · Stochastic Gradient Optimization Techniques
MethodsSparse Evolutionary Training · Linear Regression
