Student-t processes as infinite-width limits of posterior Bayesian   neural networks

Francesco Caporali; Stefano Favaro; Dario Trevisan

arXiv:2502.04247·stat.ML·February 7, 2025

Student-t processes as infinite-width limits of posterior Bayesian neural networks

Francesco Caporali, Stefano Favaro, Dario Trevisan

PDF

Open Access

TL;DR

This paper demonstrates that Bayesian neural networks with Gaussian priors and inverse-gamma variance priors converge to Student-t processes in the infinite-width limit, providing a more flexible uncertainty model.

Contribution

It extends the understanding of BNN asymptotics by showing convergence to Student-t processes under specific prior assumptions.

Findings

01

Posterior BNNs approximate Student-t processes in the infinite-width limit.

02

The convergence rate is controlled using the Wasserstein metric.

03

Student-t processes offer greater flexibility than Gaussian processes for modeling uncertainty.

Abstract

The asymptotic properties of Bayesian Neural Networks (BNNs) have been extensively studied, particularly regarding their approximations by Gaussian processes in the infinite-width limit. We extend these results by showing that posterior BNNs can be approximated by Student-t processes, which offer greater flexibility in modeling uncertainty. Specifically, we show that, if the parameters of a BNN follow a Gaussian prior distribution, and the variance of both the last hidden layer and the Gaussian likelihood function follows an Inverse-Gamma prior distribution, then the resulting posterior BNN converges to a Student-t process in the infinite-width limit. Our proof leverages the Wasserstein metric to establish control over the convergence rate of the Student-t process approximation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications