The Nuclear Route: Sharp Asymptotics of ERM in Overparameterized Quadratic Networks

Vittorio Erba; Emanuele Troiani; Lenka Zdeborov\'a; Florent Krzakala

arXiv:2505.17958·stat.ML·February 3, 2026

The Nuclear Route: Sharp Asymptotics of ERM in Overparameterized Quadratic Networks

Vittorio Erba, Emanuele Troiani, Lenka Zdeborov\'a, Florent Krzakala

PDF

1 Repo 1 Video

TL;DR

This paper analyzes the asymptotic behavior of overparameterized quadratic neural networks trained with ERM, revealing how low-rank structures influence generalization and capacity control in high dimensions.

Contribution

It introduces a novel mapping of the ERM problem to a convex matrix sensing task, providing sharp asymptotics and insights into the role of low-rank structures in overparameterized networks.

Findings

01

Characterizes global minima and generalization thresholds.

02

Shows capacity control emerges from low-rank feature maps.

03

Establishes a deep link between matrix sensing and neural network learning.

Abstract

We study the high-dimensional asymptotics of empirical risk minimization (ERM) in over-parametrized two-layer neural networks with quadratic activations trained on synthetic data. We derive sharp asymptotics for both training and test errors by mapping the $ℓ_{2}$ -regularized learning problem to a convex matrix sensing task with nuclear norm penalization. This reveals that capacity control in such networks emerges from a low-rank structure in the learned feature maps. Our results characterize the global minima of the loss and yield precise generalization thresholds, showing how the width of the target function governs learnability. This analysis bridges and extends ideas from spin-glass methods, matrix factorization, and convex optimization and emphasizes the deep link between low-rank matrix sensing and learning in quadratic neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

spoc-group/overparametrisednet
pytorchOfficial

Videos

The Nuclear Route: Sharp Asymptotics of ERM in Overparameterized Quadratic Networks· slideslive