Finite Neural Networks as Mixtures of Gaussian Processes: From Provable Error Bounds to Prior Selection

Steven Adams; Andrea Patan\`e; Morteza Lahijanian; and Luca Laurenti

arXiv:2407.18707·cs.LG·March 24, 2026·1 cites

Finite Neural Networks as Mixtures of Gaussian Processes: From Provable Error Bounds to Prior Selection

Steven Adams, Andrea Patan\`e, Morteza Lahijanian, and Luca Laurenti

PDF

Open Access

TL;DR

This paper introduces a method to approximate finite neural networks with mixtures of Gaussian processes, providing error bounds and enabling better understanding and tuning of neural network behavior.

Contribution

The authors develop an algorithmic framework to approximate finite neural networks with mixtures of Gaussian processes, including error bounds and applications to prior selection.

Findings

01

The method provides an $oldsymbol{ extepsilon}$-close approximation at finite input points.

02

It enables tuning neural networks to mimic Gaussian process behavior.

03

Empirical results demonstrate effectiveness on regression and classification tasks.

Abstract

Infinitely wide or deep neural networks (NNs) with independent and identically distributed (i.i.d.) parameters have been shown to be equivalent to Gaussian processes. Because of the favorable properties of Gaussian processes, this equivalence is commonly employed to analyze neural networks and has led to various breakthroughs over the years. However, neural networks and Gaussian processes are equivalent only in the limit; in the finite case there are currently no methods available to approximate a trained neural network with a Gaussian model with bounds on the approximation error. In this work, we present an algorithmic framework to approximate a neural network of finite width and depth, and with not necessarily i.i.d. parameters, with a mixture of Gaussian processes with error bounds on the approximation error. In particular, we consider the Wasserstein distance to quantify the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsSparse Evolutionary Training