TL;DR
This paper develops a theoretical framework connecting neural networks to quantum field theory, using Wilsonian effective field theory to analyze their behavior and likelihoods, and relates overparameterization to model simplicity.
Contribution
It introduces a novel approach linking neural networks with quantum field theory concepts, providing a new perspective on their likelihoods and overparameterization.
Findings
Neural networks in the asymptotic limit behave like Gaussian processes.
Non-Gaussian corrections correspond to particle interactions in field theory.
The formalism applies broadly to architectures approaching Gaussian processes asymptotically.
Abstract
We propose a theoretical understanding of neural networks in terms of Wilsonian effective field theory. The correspondence relies on the fact that many asymptotic neural networks are drawn from Gaussian processes, the analog of non-interacting field theories. Moving away from the asymptotic limit yields a non-Gaussian process and corresponds to turning on particle interactions, allowing for the computation of correlation functions of neural network outputs with Feynman diagrams. Minimal non-Gaussian process likelihoods are determined by the most relevant non-Gaussian terms, according to the flow in their coefficients induced by the Wilsonian renormalization group. This yields a direct connection between overparameterization and simplicity of neural network likelihoods. Whether the coefficients are constants or functions may be understood in terms of GP limit symmetries, as expected from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
