Neural Collapse by Design: Learning Class Prototypes on the Hypersphere

Panagiotis Koromilas; Theodoros Giannakopoulos; Mihalis A. Nicolaou; Yannis Panagakis

arXiv:2605.20302·cs.LG·May 22, 2026

Neural Collapse by Design: Learning Class Prototypes on the Hypersphere

Panagiotis Koromilas, Theodoros Giannakopoulos, Mihalis A. Nicolaou, Yannis Panagakis

PDF

TL;DR

This paper introduces normalized contrastive losses and a prototype-based framework to achieve Neural Collapse in supervised learning, improving accuracy, transferability, and robustness.

Contribution

It proposes NTCE and NONL losses that fix the limitations of CE and SCL, enabling supervised learning to reach Neural Collapse by design.

Findings

01

NTCE and NONL surpass CE accuracy on benchmarks including ImageNet-1K.

02

They closely approximate Neural Collapse (≥95%) in fewer iterations.

03

Improved transfer learning and robustness to corruptions demonstrated.

Abstract

Supervised classification has a theoretical optimum, Neural Collapse (NC), yet neither of its two dominant paradigms reaches it in practice. Cross entropy (CE) leaves radial degrees of freedom unconstrained and converges to a degenerate geometry, while supervised contrastive learning (SCL) drives features toward NC during pretraining but discards this structure in a post hoc linear probing phase. We show that both paradigms are different appearances of the same method that contrasts prototypes on the unit hypersphere, and that closing the gap requires fixing each at its point of failure. From the CE side, we propose NTCE and NONL, two normalized losses that import contrastive optimization's missing ingredients into classifier learning: a large effective negative set and decoupled alignment and uniformity terms. From the SCL side, we prove that SCL's objective already optimizes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.