Quantitative CLTs in Deep Neural Networks
Stefano Favaro, Boris Hanin, Domenico Marinucci, Ivan Nourdin,, Giovanni Peccati

TL;DR
This paper provides quantitative bounds on how closely deep neural networks with Gaussian weights approximate Gaussian processes as width increases, with improved and optimal convergence rates.
Contribution
It introduces new, stronger bounds on the finite-width approximation of neural networks to Gaussian processes, valid at large but finite widths and fixed depths.
Findings
Bounds scale as n^{-mma} for the discrepancy between network and Gaussian process
Bounds are strictly stronger than previous results in the literature
In one-dimensional case, bounds are proven to be optimal with matching lower bounds
Abstract
We study the distribution of a fully connected neural network with random Gaussian weights and biases in which the hidden layer widths are proportional to a large constant . Under mild assumptions on the non-linearity, we obtain quantitative bounds on normal approximations valid at large but finite and any fixed network depth. Our theorems show both for the finite-dimensional distributions and the entire process, that the distance between a random fully connected network (and its derivatives) to the corresponding infinite width Gaussian process scales like for , with the exponent depending on the metric used to measure discrepancy. Our bounds are strictly stronger in terms of their dependence on network width than any previously available in the literature; in the one-dimensional case, we also prove that they are optimal, i.e., we establish matching lower…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Statistical Methods and Inference · Neural Networks and Applications
MethodsGaussian Process
