Deep Stable neural networks: large-width asymptotics and convergence rates
Stefano Favaro, Sandra Fortini, Stefano Peluchetti

TL;DR
This paper rigorously analyzes the asymptotic behavior of deep Stable neural networks with stable-distributed weights as their width grows infinitely, establishing convergence to Stable stochastic processes and comparing growth regimes.
Contribution
It extends large-width asymptotic analysis from Gaussian to Stable neural networks, providing the first convergence rates in the joint growth setting.
Findings
Deep Stable NNs converge to Stable stochastic processes as width increases.
Convergence rates differ between joint and sequential growth regimes.
The joint growth leads to slower convergence rates depending on network depth.
Abstract
In modern deep learning, there is a recent and growing literature on the interplay between large-width asymptotic properties of deep Gaussian neural networks (NNs), i.e. deep NNs with Gaussian-distributed weights, and Gaussian stochastic processes (SPs). Such an interplay has proved to be critical in Bayesian inference under Gaussian SP priors, kernel regression for infinitely wide deep NNs trained via gradient descent, and information propagation within infinitely wide NNs. Motivated by empirical analyses that show the potential of replacing Gaussian distributions with Stable distributions for the NN's weights, in this paper we present a rigorous analysis of the large-width asymptotic behaviour of (fully connected) feed-forward deep Stable NNs, i.e. deep NNs with Stable-distributed weights. We show that as the width goes to infinity jointly over the NN's layers, i.e. the ``joint…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference
