Exploring Loss Landscapes through the Lens of Spin Glass Theory
Hao Liao, Wei Zhang, Zhanyi Huang, Zexiao Long, Mingyang Zhou, Xiaoqun, Wu, Rui Mao, Chi Ho Yeung

TL;DR
This paper applies spin glass theory to analyze the loss landscapes of deep neural networks, revealing insights into their structure, symmetry, and generalizability, and introducing protocols to study these complex landscapes.
Contribution
It introduces a novel perspective by modeling DNN loss landscapes using spin glass theory and develops protocols to analyze their structure and generalization properties.
Findings
Loss landscapes exhibit hierarchical structures similar to spin glass states.
Permutation symmetry leads to multiple equivalent minima in the loss landscape.
Flatter minima correlate with better generalization in DNNs.
Abstract
In the past decade, significant strides in deep learning have led to numerous groundbreaking applications. Despite these advancements, the understanding of the high generalizability of deep learning, especially in such an over-parametrized space, remains limited. For instance, in deep neural networks (DNNs), their internal representations, decision-making mechanism, absence of overfitting in an over-parametrized space, superior generalizability, etc., remain less understood. Successful applications are often considered as empirical rather than scientific achievement. This paper delves into the loss landscape of DNNs through the lens of spin glass in statistical physics, a system characterized by a complex energy landscape with numerous metastable states, as a novel perspective in understanding how DNNs work. We investigated the loss landscape of single hidden layer neural networks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTheoretical and Computational Physics
