When Does Learning Renormalize? Sufficient Conditions for Power Law Spectral Dynamics
Yizhou Zhang

TL;DR
This paper establishes sufficient conditions under which spectral energy transport in deep learning exhibits power-law scaling, linking renormalizability and invariance properties to observed spectral dynamics.
Contribution
It introduces the GRSD framework and identifies structural conditions that lead to power-law spectral scaling in deep learning models.
Findings
Power-law scaling arises under specific structural conditions.
Renormalizability alone does not guarantee power-law behavior.
Log-shift invariance combined with gradient flow covariance enforces power-law scaling.
Abstract
Empirical power--law scaling has been widely observed across modern deep learning systems, yet its theoretical origins and scope of validity remain incompletely understood. The Generalized Resolution--Shell Dynamics (GRSD) framework models learning as spectral energy transport across logarithmic resolution shells, providing a coarse--grained dynamical description of training. Within GRSD, power--law scaling corresponds to a particularly simple renormalized shell dynamics; however, such behavior is not automatic and requires additional structural properties of the learning process. In this work, we identify a set of sufficient conditions under which the GRSD shell dynamics admits a renormalizable coarse--grained description. These conditions constrain the learning configuration at multiple levels, including boundedness of gradient propagation in the computation graph, weak functional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum many-body systems · Neural Networks and Reservoir Computing · Machine Learning in Materials Science
