Random Features Model with General Convex Regularization: A Fine Grained Analysis with Precise Asymptotic Learning Curves
David Bosch, Ashkan Panahi, Ayca \"Ozcelikkale, Devdatt Dubhash

TL;DR
This paper derives precise asymptotic learning curves for random feature models with convex regularization, providing a computable scalar optimization and extending universality results to $ ext{l}_1$ regularization, with practical numerical validation.
Contribution
It introduces a novel multi-level CGMT approach for explicit asymptotic analysis of RF models with correlated data and extends universality to $ ext{l}_1$ regularization.
Findings
Derived explicit asymptotic expressions for learning curves.
Extended universality results to $ ext{l}_1$ regularization.
Numerical validation shows accurate test error predictions.
Abstract
We compute precise asymptotic expressions for the learning curves of least squares random feature (RF) models with either a separable strongly convex regularization or the regularization. We propose a novel multi-level application of the convex Gaussian min max theorem (CGMT) to overcome the traditional difficulty of finding computable expressions for random features models with correlated data. Our result takes the form of a computable 4-dimensional scalar optimization. In contrast to previous results, our approach does not require solving an often intractable proximal operator, which scales with the number of model parameters. Furthermore, we extend the universality results for the training and generalization errors for RF models to regularization. In particular, we demonstrate that under mild conditions, random feature models with elastic net or …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning
