Dynamics of Finite Width Kernel and Prediction Fluctuations in Mean Field Neural Networks
Blake Bordelon, Cengiz Pehlevan

TL;DR
This paper investigates how finite width affects the dynamics and fluctuations of neural networks during training, revealing how feature learning influences variance reduction and learning speed, with empirical validation on CNNs.
Contribution
It provides a non-perturbative analysis of finite width effects in neural networks, extending dynamical mean field theory to characterize prediction fluctuations and feature learning impacts.
Findings
Finite width causes O(1/√width) fluctuations in network predictions.
Feature learning reduces variance of kernels and predictions in two-layer networks.
Initialization variance can slow down online learning and affect dynamics in CNNs.
Abstract
We analyze the dynamics of finite width effects in wide but finite feature learning neural networks. Starting from a dynamical mean field theory description of infinite width deep neural network kernel and prediction dynamics, we provide a characterization of the fluctuations of the DMFT order parameters over random initializations of the network weights. Our results, while perturbative in width, unlike prior analyses, are non-perturbative in the strength of feature learning. In the lazy limit of network training, all kernels are random but static in time and the prediction variance has a universal form. However, in the rich, feature learning regime, the fluctuations of the kernels and predictions are dynamically coupled with a variance that can be computed self-consistently. In two layer networks, we show how feature learning can dynamically reduce the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Stochastic Gradient Optimization Techniques
