An Equivariance Toolbox for Learning Dynamics
Yongyi Yang, Liu Ziyin

TL;DR
This paper introduces a comprehensive equivariance toolbox that extends classical symmetry analyses to second-order learning dynamics, providing new insights into the geometry of neural network loss landscapes and their relation to transformations.
Contribution
It develops a general framework for analyzing both first- and second-order constraints in neural learning dynamics, extending Noether-type analyses to broader transformations and discrete cases.
Findings
Unifies conservation laws and implicit biases under a single identity.
Predicts curvature properties and flat/sharp directions in loss landscapes.
Connects transformation structures to empirical optimization observations.
Abstract
Many theoretical results in deep learning can be traced to symmetry or equivariance of neural networks under parameter transformations. However, existing analyses are typically problem-specific and focus on first-order consequences such as conservation laws, while the implications for second-order structure remain less understood. We develop a general equivariance toolbox that yields coupled first- and second-order constraints on learning dynamics. The framework extends classical Noether-type analyses in three directions: from gradient constraints to Hessian constraints, from symmetry to general equivariance, and from continuous to discrete transformations. At the first order, our framework unifies conservation laws and implicit-bias relations as special cases of a single identity. At the second order, it provides structural predictions about curvature: which directions are flat or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference · Advanced Graph Neural Networks
