Memorization-Dilation: Modeling Neural Collapse Under Label Noise

Duc Anh Nguyen; Ron Levie; Julian Lienen; Gitta Kutyniok; Eyke; H\"ullermeier

arXiv:2206.05530·cs.LG·April 5, 2023·1 cites

Memorization-Dilation: Modeling Neural Collapse Under Label Noise

Duc Anh Nguyen, Ron Levie, Julian Lienen, Gitta Kutyniok, Eyke, H\"ullermeier

PDF

Open Access 1 Repo

TL;DR

This paper investigates how neural collapse phenomena are affected by label noise and memorization, proposing a realistic model that explains the regularization effects of label smoothing and the impact of noise on neural network representations.

Contribution

It introduces a memorization-dilation model that accounts for limited network expressivity and explains how different loss functions influence performance on noisy data.

Findings

01

Memorization of noisy data causes dilation of neural collapse.

02

Different loss functions lead to varying performance on noisy datasets.

03

Label smoothing acts as a regularizer improving generalization.

Abstract

The notion of neural collapse refers to several emergent phenomena that have been empirically observed across various canonical classification problems. During the terminal phase of training a deep neural network, the feature embedding of all examples of the same class tend to collapse to a single representation, and the features of different classes tend to separate as much as possible. Neural collapse is often studied through a simplified model, called the unconstrained feature representation, in which the model is assumed to have "infinite expressivity" and can map each data point to any arbitrary representation. In this work, we propose a more realistic variant of the unconstrained feature representation that takes the limited expressivity of the network into account. Empirical evidence suggests that the memorization of noisy data points leads to a degradation (dilation) of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

julilien/memorizationdilation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification