Asymptotics of Wide Networks from Feynman Diagrams

Ethan Dyer; Guy Gur-Ari

arXiv:1909.11304·cs.LG·September 26, 2019·45 cites

Asymptotics of Wide Networks from Feynman Diagrams

Ethan Dyer, Guy Gur-Ari

PDF

Open Access

TL;DR

This paper introduces a Feynman diagram-based method to analyze the asymptotic behavior of wide neural networks, providing new theoretical insights and empirical validations of training dynamics beyond the large width limit.

Contribution

It adapts Feynman diagrams for analyzing wide networks, deriving higher-order terms and improving bounds on training dynamics during stochastic gradient descent.

Findings

01

Improved bounds on training dynamics of wide networks.

02

Closed-form expressions for higher-order terms.

03

Empirical validation of theoretical predictions.

Abstract

Understanding the asymptotic behavior of wide networks is of considerable interest. In this work, we present a general method for analyzing this large width behavior. The method is an adaptation of Feynman diagrams, a standard tool for computing multivariate Gaussian integrals. We apply our method to study training dynamics, improving existing bounds and deriving new results on wide network evolution during stochastic gradient descent. Going beyond the strict large width limit, we present closed-form expressions for higher-order terms governing wide network training, and test these predictions empirically.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topicsadvanced mathematical theories · Stochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods