The edge of chaos: quantum field theory and deep neural networks
Kevin T. Grosvenor, Ro Jefferson

TL;DR
This paper develops a quantum field theory framework for deep neural networks, analyzing criticality and fluctuations to understand information propagation and trainability.
Contribution
It constructs a QFT model for neural networks, deriving criticality conditions and loop corrections, linking neural dynamics to well-known physical models.
Findings
Derived the mean-field theory and criticality conditions for neural networks.
Computed loop corrections analogous to the $O(N)$ vector model.
Identified how finite-width effects influence information propagation.
Abstract
We explicitly construct the quantum field theory corresponding to a general class of deep neural networks encompassing both recurrent and feedforward architectures. We first consider the mean-field theory (MFT) obtained as the leading saddlepoint in the action, and derive the condition for criticality via the largest Lyapunov exponent. We then compute the loop corrections to the correlation function in a perturbative expansion in the ratio of depth to width , and find a precise analogy with the well-studied vector model, in which the variance of the weight initializations plays the role of the 't Hooft coupling. In particular, we compute both the corrections quantifying fluctuations from typicality in the ensemble of networks, and the subleading corrections due to finite-width effects. These provide corrections to the correlation length…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum many-body systems · Quantum and electron transport phenomena · Theoretical and Computational Physics
