Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)

Yoonsoo Nam; Seok Hyeong Lee; Clementine C J Domine; Yeachan Park; Charles London; Wonyl Choi; Niclas Goring; Seungjai Lee

arXiv:2502.21009·stat.ML·May 27, 2025

Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)

Yoonsoo Nam, Seok Hyeong Lee, Clementine C J Domine, Yeachan Park, Charles London, Wonyl Choi, Niclas Goring, Seungjai Lee

PDF

1 Video

TL;DR

This paper advocates for using layerwise linear models to better understand complex neural phenomena like neural collapse, emergence, and grokking, aiming to accelerate deep learning research.

Contribution

It introduces the dynamical feedback principle in layerwise linear models as a unifying framework for neural phenomena, emphasizing their role in understanding deep neural network dynamics.

Findings

01

Layerwise linear models explain neural collapse and emergence.

02

Dynamical feedback principle unifies various neural phenomena.

03

Using simplified models accelerates deep learning understanding.

Abstract

In physics, complex systems are often simplified into minimal, solvable models that retain only the core principles. In machine learning, layerwise linear models (e.g., linear neural networks) act as simplified representations of neural network dynamics. These models follow the dynamical feedback principle, which describes how layers mutually govern and amplify each other's evolution. This principle extends beyond the simplified models, successfully explaining a wide range of dynamical phenomena in deep neural networks, including neural collapse, emergence, lazy and rich regimes, and grokking. In this position paper, we call for the use of layerwise linear models retaining the core principles of neural dynamical phenomena to accelerate the science of deep learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)· slideslive