From Shallow Bayesian Neural Networks to Gaussian Processes: General Convergence, Identifiability and Scalable Inference
Gracielle Antunes de Ara\'ujo, Fl\'avio B. Gon\c{c}alves

TL;DR
This paper establishes a general convergence from shallow Bayesian neural networks to Gaussian processes, introduces a new covariance function, and develops scalable inference methods with competitive results on real datasets.
Contribution
It provides a relaxed convergence theory, a novel covariance function, and a scalable Nyström-based inference method for BNNs and GPs.
Findings
Convergence from BNNs to GPs under relaxed assumptions
A new covariance function with positive definiteness and identifiability
Scalable MAP inference with competitive predictive performance
Abstract
In this work, we study scaling limits of shallow Bayesian neural networks (BNNs) via their connection to Gaussian processes (GPs), with an emphasis on statistical modeling, identifiability, and scalable inference. We first establish a general convergence result from BNNs to GPs by relaxing assumptions used in prior formulations, and we compare alternative parameterizations of the limiting GP model. Building on this theory, we propose a new covariance function defined as a convex mixture of components induced by four widely used activation functions, and we characterize key properties including positive definiteness and both strict and practical identifiability under different input designs. For computation, we develop a scalable maximum a posterior (MAP) training and prediction procedure using a Nystr\"om approximation, and we show how the Nystr\"om rank and anchor selection control the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Bayesian Modeling and Causal Inference · Adversarial Robustness in Machine Learning
