Finite-Dimensional Gaussian Approximation for Deep Neural Networks: Universality in Random Weights

Krishnakumar Balasubramanian; Nathan Ross

arXiv:2507.12686·stat.ML·March 5, 2026

Finite-Dimensional Gaussian Approximation for Deep Neural Networks: Universality in Random Weights

Krishnakumar Balasubramanian, Nathan Ross

PDF

Open Access

TL;DR

This paper proves that deep neural networks with random weights and finite moments can be approximated by Gaussian distributions, with convergence rates depending on layer widths and depth, under certain conditions.

Contribution

It establishes Gaussian approximation bounds for neural network distributions with finite moments, extending understanding of their universality properties.

Findings

01

Gaussian approximation bounds in Wasserstein-1 norm

02

Convergence rates depend on layer widths and depth

03

Applicable to networks with Lipschitz activation functions

Abstract

We study the Finite-Dimensional Distributions (FDDs) of deep neural networks with randomly initialized weights that have finite-order moments. Specifically, we establish Gaussian approximation bounds in the Wasserstein- $1$ norm between the FDDs and their Gaussian limit assuming a Lipschitz activation function and allowing the layer widths to grow to infinity at arbitrary relative rates. In the special case where all widths are proportional to a common scale parameter $n$ and there are $L - 1$ hidden layers, we obtain convergence rates of order $n^{- (1 / 6)^{L - 1} + ϵ}$ , for any $ϵ > 0$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Tensor decomposition and applications