Random Neural Networks in the Infinite Width Limit as Gaussian Processes

Boris Hanin

arXiv:2107.01562·math.PR·July 6, 2021

Random Neural Networks in the Infinite Width Limit as Gaussian Processes

Boris Hanin

PDF

TL;DR

This paper proves that fully connected neural networks with random weights converge to Gaussian processes as hidden layer widths grow infinitely large, under broad conditions on weight distributions and nonlinearities.

Contribution

It provides a new proof of Gaussian process convergence for neural networks, requiring only moment conditions and accommodating general nonlinearities.

Findings

01

Neural networks with random weights converge to Gaussian processes in the infinite width limit.

02

Convergence holds under minimal moment conditions on weight distributions.

03

The proof applies to a wide class of nonlinear activation functions.

Abstract

This article gives a new proof that fully connected neural networks with random weights and biases converge to Gaussian processes in the regime where the input dimension, output dimension, and depth are kept fixed, while the hidden layer widths tend to infinity. Unlike prior work, convergence is shown assuming only moment conditions for the distribution of weights and for quite general non-linearities.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.