Geometric Analysis of Unconstrained Feature Models with $d=K$

Yi Shen; Shao Gu

arXiv:2407.10702·cs.LG·July 23, 2024

Geometric Analysis of Unconstrained Feature Models with $d=K$

Yi Shen, Shao Gu

PDF

Open Access

TL;DR

This paper analyzes unconstrained feature models in deep learning, proving that all critical points are either global minima or strict saddles when feature dimension equals number of classes, confirming prior conjectures.

Contribution

It rigorously proves that two popular unconstrained feature models are strict saddle functions with no suboptimal local minima, confirming previous conjectures.

Findings

01

All critical points are either global minima or strict saddles.

02

The models are strict saddle functions with negative curvature at saddle points.

03

Confirms conjecture on the landscape of unconstrained feature models.

Abstract

Recently, interesting empirical phenomena known as Neural Collapse have been observed during the final phase of training deep neural networks for classification tasks. We examine this issue when the feature dimension d is equal to the number of classes K. We demonstrate that two popular unconstrained feature models are strict saddle functions, with every critical point being either a global minimum or a strict saddle point that can be exited using negative curvatures. The primary findings conclusively confirm the conjecture on the unconstrained feature models in previous articles.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis · Computational Geometry and Mesh Generation