Statistical Mechanics of Deep Linear Neural Networks: The Back-Propagating Kernel Renormalization
Qianyi Li, Haim Sompolinsky

TL;DR
This paper develops an exact statistical mechanics framework for deep linear neural networks, introducing the Back-Propagating Kernel Renormalization method to analyze network properties and extending it to ReLU networks, revealing insights into learning and generalization.
Contribution
It introduces the BPKR method for exact analysis of deep linear networks and extends it to ReLU networks, providing new theoretical insights into deep learning.
Findings
BPKR enables exact evaluation of generalization error and network properties.
ReLU networks share many properties with linear networks in a wide parameter regime.
The theory offers a new perspective on learning dynamics and representation across layers.
Abstract
The success of deep learning in many real-world tasks has triggered an intense effort to understand the power and limitations of deep learning in the training and generalization of complex tasks, so far with limited progress. In this work, we study the statistical mechanics of learning in Deep Linear Neural Networks (DLNNs) in which the input-output function of an individual unit is linear. Despite the linearity of the units, learning in DLNNs is nonlinear, hence studying its properties reveals some of the features of nonlinear Deep Neural Networks (DNNs). Importantly, we solve exactly the network properties following supervised learning using an equilibrium Gibbs distribution in the weight space. To do this, we introduce the Back-Propagating Kernel Renormalization (BPKR), which allows for the incremental integration of the network weights starting from the network output layer and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods*Communicated@Fast*How Do I Communicate to Expedia?
