The $\varphi$ Curve: The Shape of Generalization through the Lens of Norm-based Capacity Control
Yichen Wang, Yudong Chen, Lorenzo Rosasco, Fanghui Liu

TL;DR
This paper investigates how norm-based capacity measures explain the shape of learning curves in over-parameterized models, revealing a phase transition without double descent, contrasting classical theories.
Contribution
It provides a precise analysis of norm concentration in random features estimators and demonstrates the phase transition in test error without double descent behavior.
Findings
Learning curves exhibit a phase transition from under- to over-parameterization.
Classical U-shaped test error behavior is recovered with norm-based capacity measures.
No double descent behavior is observed in the studied models.
Abstract
Understanding how the test risk scales with model complexity is a central question in machine learning. Classical theory is challenged by the learning curves observed for large over-parametrized deep networks. Capacity measures based on parameter count typically fail to account for these empirical observations. To tackle this challenge, we consider norm-based capacity measures and develop our study for random features based estimators, widely used as simplified theoretical models for more complex networks. In this context, we provide a precise characterization of how the estimator's norm concentrates and how it governs the associated test error. Our results show that the predicted learning curve admits a phase transition from under- to over-parameterization, but no double descent behavior. This confirms that more classical U-shaped behavior is recovered considering appropriate capacity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Statistical Mechanics and Entropy
MethodsFocus
