TL;DR
This paper analyzes the predictive risk of ridge regression and regularized discriminant analysis in high-dimensional settings, providing explicit formulas that depend on the covariance spectrum and revealing nuanced effects of covariance structure.
Contribution
It offers a unified, explicit characterization of predictive risk for both methods in high dimensions, accounting for arbitrary covariance structures and connecting risk measures.
Findings
Predictive risk depends on the spectrum of the covariance matrix.
There is an inverse relation between predictive risk and estimation risk for ridge regression.
Eigenvalue distribution significantly influences classification accuracy.
Abstract
We provide a unified analysis of the predictive risk of ridge regression and regularized discriminant analysis in a dense random effects model. We work in a high-dimensional asymptotic regime where and , and allow for arbitrary covariance among the features. For both methods, we provide an explicit and efficiently computable expression for the limiting predictive risk, which depends only on the spectrum of the feature-covariance matrix, the signal strength, and the aspect ratio . Especially in the case of regularized discriminant analysis, we find that predictive accuracy has a nuanced dependence on the eigenvalue distribution of the covariance matrix, suggesting that analyses based on the operator norm of the covariance matrix may not be sharp. Our results also uncover several qualitative insights about both methods: for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
