Symmetry & Critical Points

Yossi Arjevani

arXiv:2408.14445·cs.LG·August 27, 2024

Symmetry & Critical Points

Yossi Arjevani

PDF

Open Access

TL;DR

This paper investigates the nature of critical points in symmetric functions, revealing that symmetric critical points tend to be surrounded by symmetry-breaking points, which impacts optimization in neural networks.

Contribution

It introduces a mathematical mechanism showing that symmetric critical points are typically surrounded by symmetry-breaking points, affecting invariant nonconvex function minimization.

Findings

01

Symmetric critical points are often adjacent to symmetry-breaking points.

02

Implications for neural network optimization and invariant nonconvex functions.

03

Provides a theoretical foundation for understanding symmetry in critical points.

Abstract

Critical points of an invariant function may or may not be symmetric. We prove, however, that if a symmetric critical point exists, those adjacent to it are generically symmetry breaking. This mathematical mechanism is shown to carry important implications for our ability to efficiently minimize invariant nonconvex functions, in particular those associated with neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Neural Networks and Applications · Advanced Optimization Algorithms Research