Finite Neural Networks as Mixtures of Gaussian Processes: From Provable Error Bounds to Prior Selection
Steven Adams, Andrea Patan\`e, Morteza Lahijanian, and Luca Laurenti

TL;DR
This paper introduces a method to approximate finite neural networks with mixtures of Gaussian processes, providing error bounds and enabling better understanding and tuning of neural network behavior.
Contribution
The authors develop an algorithmic framework to approximate finite neural networks with mixtures of Gaussian processes, including error bounds and applications to prior selection.
Findings
The method provides an $oldsymbol{ extepsilon}$-close approximation at finite input points.
It enables tuning neural networks to mimic Gaussian process behavior.
Empirical results demonstrate effectiveness on regression and classification tasks.
Abstract
Infinitely wide or deep neural networks (NNs) with independent and identically distributed (i.i.d.) parameters have been shown to be equivalent to Gaussian processes. Because of the favorable properties of Gaussian processes, this equivalence is commonly employed to analyze neural networks and has led to various breakthroughs over the years. However, neural networks and Gaussian processes are equivalent only in the limit; in the finite case there are currently no methods available to approximate a trained neural network with a Gaussian model with bounds on the approximation error. In this work, we present an algorithmic framework to approximate a neural network of finite width and depth, and with not necessarily i.i.d. parameters, with a mixture of Gaussian processes with error bounds on the approximation error. In particular, we consider the Wasserstein distance to quantify the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsSparse Evolutionary Training
