Parameter Symmetry Potentially Unifies Deep Learning Theory
Liu Ziyin, Yizhou Xu, Tomaso Poggio, Isaac Chuang

TL;DR
This paper advocates for the study of parameter symmetries as a unifying principle to understand hierarchical learning phenomena in large AI systems, linking dynamics, complexity, and representations.
Contribution
It proposes that parameter symmetry breaking and restoration could unify existing fragmented theories of neural network behavior.
Findings
Parameter symmetry may unify learning dynamics, complexity, and representation hierarchies.
Symmetry principles from physics could be fundamental in AI theory.
Position paper synthesizes prior theories to support this unifying hypothesis.
Abstract
The dynamics of learning in modern large AI systems is hierarchical, often characterized by abrupt, qualitative shifts akin to phase transitions observed in physical systems. While these phenomena hold promise for uncovering the mechanisms behind neural networks and language models, existing theories remain fragmented, addressing specific cases. In this position paper, we advocate for the crucial role of the research direction of parameter symmetries in unifying these fragmented theories. This position is founded on a centralizing hypothesis for this direction: parameter symmetry breaking and restoration are the unifying mechanisms underlying the hierarchical learning behavior of AI models. We synthesize prior observations and theories to argue that this direction of research could lead to a unified understanding of three distinct hierarchies in neural networks: learning dynamics, model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
