Complexity from Adaptive-Symmetries Breaking: Global Minima in the Statistical Mechanics of Deep Neural Networks
Shawn W. M. Li

TL;DR
This paper introduces a new theoretical framework based on adaptive symmetries to understand how deep neural networks self-organize into functional structures, revealing conditions for achieving zero training error.
Contribution
It proposes a statistical-mechanical model of DNNs using adaptive symmetries, extending physics concepts to analyze complex neural systems and their optimization processes.
Findings
Large DNNs have extensive adaptive symmetries reservoir.
When information capacity exceeds dataset complexity, DNNs self-organize with zero training error.
The model generalizes statistical physics to describe high-dimensional regularities in DNNs.
Abstract
An antithetical concept, adaptive symmetry, to conservative symmetry in physics is proposed to understand the deep neural networks (DNNs). It characterizes the invariance of variance, where a biotic system explores different pathways of evolution with equal probability in absence of feedback signals, and complex functional structure emerges from quantitative accumulation of adaptive-symmetries breaking in response to feedback signals. Theoretically and experimentally, we characterize the optimization process of a DNN system as an extended adaptive-symmetry-breaking process. One particular finding is that a hierarchically large DNN would have a large reservoir of adaptive symmetries, and when the information capacity of the reservoir exceeds the complexity of the dataset, the system could absorb all perturbations of the examples and self-organize into a functional structure of zero…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Neural dynamics and brain function · Machine Learning in Materials Science
