Symmetry Induces Structure and Constraint of Learning

Liu Ziyin

arXiv:2309.16932·cs.LG·June 4, 2024

Symmetry Induces Structure and Constraint of Learning

Liu Ziyin

PDF

Open Access

TL;DR

This paper reveals how symmetries in the loss function of neural networks influence learning behavior, leading to specific parameter constraints and explaining phenomena like sparsity and low rankness.

Contribution

It establishes a theoretical link between loss function symmetries and parameter constraints, providing insights into neural network behaviors and potential algorithmic applications.

Findings

01

Mirror-reflection symmetry induces parameter constraints.

02

Rescaling symmetry leads to sparsity in models.

03

Rotation symmetry results in low-rankness.

Abstract

Due to common architecture designs, symmetries exist extensively in contemporary neural networks. In this work, we unveil the importance of the loss function symmetries in affecting, if not deciding, the learning behavior of machine learning models. We prove that every mirror-reflection symmetry, with reflection surface $O$ , in the loss function leads to the emergence of a constraint on the model parameters $θ$ : $O^{T} θ = 0$ . This constrained solution becomes satisfied when either the weight decay or gradient noise is large. Common instances of mirror symmetries in deep learning include rescaling, rotation, and permutation symmetry. As direct corollaries, we show that rescaling symmetry leads to sparsity, rotation symmetry leads to low rankness, and permutation symmetry leads to homogeneous ensembling. Then, we show that the theoretical framework can explain intriguing phenomena,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning in Materials Science · Medical Image Segmentation Techniques

MethodsWeight Decay