Mean Field Limit of the Learning Dynamics of Multilayer Neural Networks

Phan-Minh Nguyen

arXiv:1902.02880·cs.LG·February 11, 2019·36 cites

Mean Field Limit of the Learning Dynamics of Multilayer Neural Networks

Phan-Minh Nguyen

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that large multilayer neural networks exhibit a simplified limiting behavior independent of their size, described by a set of equations, revealing a new operational regime validated by experiments.

Contribution

It introduces a formalism capturing the mean field limit of neural network dynamics, uncovering a novel regime where behavior simplifies as network size grows.

Findings

01

Behavior becomes independent of the number of neurons at large scale

02

Development of a formal set of equations describing the limit behavior

03

Experimental validation of the limiting regime

Abstract

Can multilayer neural networks -- typically constructed as highly complex structures with many nonlinearly activated neurons across layers -- behave in a non-trivial way that yet simplifies away a major part of their complexities? In this work, we uncover a phenomenon in which the behavior of these complex networks -- under suitable scalings and stochastic gradient descent dynamics -- becomes independent of the number of neurons as this number grows sufficiently large. We develop a formalism in which this many-neurons limiting behavior is captured by a set of equations, thereby exposing a previously unknown operating regime of these networks. While the current pursuit is mathematically non-rigorous, it is complemented with several experiments that validate the existence of this behavior.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

npminh12/multilayer-mean-field
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Stochastic Gradient Optimization Techniques · Model Reduction and Neural Networks