Random ReLU Neural Networks as Non-Gaussian Processes

Rahul Parhi; Pakshal Bohra; Ayoub El Biari; Mehrsa Pourya; Michael; Unser

arXiv:2405.10229·stat.ML·February 13, 2025

Random ReLU Neural Networks as Non-Gaussian Processes

Rahul Parhi, Pakshal Bohra, Ayoub El Biari, Mehrsa Pourya, Michael, Unser

PDF

Open Access

TL;DR

This paper demonstrates that shallow neural networks with ReLU activations and random parameters can be modeled as non-Gaussian stochastic processes, revealing new insights into their behavior beyond classical Gaussian limits.

Contribution

It introduces a non-asymptotic framework showing these networks form non-Gaussian processes driven by impulsive noise, extending classical Gaussian process convergence results.

Findings

01

Networks are well-defined non-Gaussian processes

02

Processes are isotropic and self-similar with Hurst exponent 3/2

03

Convergence to non-Gaussian processes under certain conditions

Abstract

We consider a large class of shallow neural networks with randomly initialized parameters and rectified linear unit activation functions. We prove that these random neural networks are well-defined non-Gaussian processes. As a by-product, we demonstrate that these networks are solutions to stochastic differential equations driven by impulsive white noise (combinations of random Dirac measures). These processes are parameterized by the law of the weights and biases as well as the density of activation thresholds in each bounded region of the input domain. We prove that these processes are isotropic and wide-sense self-similar with Hurst exponent 3/2. We also derive a remarkably simple closed-form expression for their autocovariance function. Our results are fundamentally different from prior work in that we consider a non-asymptotic viewpoint: The number of neurons in each bounded region…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference