Parameter Symmetry Potentially Unifies Deep Learning Theory

Liu Ziyin; Yizhou Xu; Tomaso Poggio; Isaac Chuang

arXiv:2502.05300·cs.LG·May 26, 2025

Parameter Symmetry Potentially Unifies Deep Learning Theory

Liu Ziyin, Yizhou Xu, Tomaso Poggio, Isaac Chuang

PDF

Open Access

TL;DR

This paper advocates for the study of parameter symmetries as a unifying principle to understand hierarchical learning phenomena in large AI systems, linking dynamics, complexity, and representations.

Contribution

It proposes that parameter symmetry breaking and restoration could unify existing fragmented theories of neural network behavior.

Findings

01

Parameter symmetry may unify learning dynamics, complexity, and representation hierarchies.

02

Symmetry principles from physics could be fundamental in AI theory.

03

Position paper synthesizes prior theories to support this unifying hypothesis.

Abstract

The dynamics of learning in modern large AI systems is hierarchical, often characterized by abrupt, qualitative shifts akin to phase transitions observed in physical systems. While these phenomena hold promise for uncovering the mechanisms behind neural networks and language models, existing theories remain fragmented, addressing specific cases. In this position paper, we advocate for the crucial role of the research direction of parameter symmetries in unifying these fragmented theories. This position is founded on a centralizing hypothesis for this direction: parameter symmetry breaking and restoration are the unifying mechanisms underlying the hierarchical learning behavior of AI models. We synthesize prior observations and theories to argue that this direction of research could lead to a unified understanding of three distinct hierarchies in neural networks: learning dynamics, model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications