Neural Collapse Beyond the Unconstrained Features Model: Landscape,   Dynamics, and Generalization in the Mean-Field Regime

Diyuan Wu; Marco Mondelli

arXiv:2501.19104·cs.LG·February 5, 2025

Neural Collapse Beyond the Unconstrained Features Model: Landscape, Dynamics, and Generalization in the Mean-Field Regime

Diyuan Wu, Marco Mondelli

PDF

Open Access 1 Video

TL;DR

This paper investigates the neural collapse phenomenon, specifically NC1, in a data-specific setting with a three-layer neural network in the mean-field regime, linking it to the loss landscape and training dynamics.

Contribution

It establishes a connection between NC1 and the loss landscape, showing that gradient flow leads to NC1 solutions with small empirical loss and test error for certain data distributions.

Findings

01

Points with small loss and gradient norm approximately satisfy NC1

02

Gradient flow converges to NC1 solutions with low empirical loss

03

NC1 and low test error co-occur for well-separated data distributions

Abstract

Neural Collapse is a phenomenon where the last-layer representations of a well-trained neural network converge to a highly structured geometry. In this paper, we focus on its first (and most basic) property, known as NC1: the within-class variability vanishes. While prior theoretical studies establish the occurrence of NC1 via the data-agnostic unconstrained features model, our work adopts a data-specific perspective, analyzing NC1 in a three-layer neural network, with the first two layers operating in the mean-field regime and followed by a linear layer. In particular, we establish a fundamental connection between NC1 and the loss landscape: we prove that points with small empirical loss and gradient norm (thus, close to being stationary) approximately satisfy NC1, and the closeness to NC1 is controlled by the residual loss and gradient norm. We then show that (i) gradient flow on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Neural Collapse Beyond the Unconstrained Features Model: Landscape, Dynamics, and Generalization in the Mean-Field Regime· slideslive

Taxonomy

TopicsNeural Networks and Applications

MethodsFocus