Bayesian neural networks with interpretable priors from Mercer kernels
Alex Alberts, Ilias Bilionis

TL;DR
This paper introduces Mercer priors for Bayesian neural networks, enabling them to incorporate interpretable Gaussian process kernels, thus improving uncertainty quantification while maintaining scalability.
Contribution
It proposes a novel class of priors called Mercer priors that align BNNs with specified GPs without structural constraints, enhancing interpretability and scalability.
Findings
Mercer priors enable BNNs to approximate specified GPs.
The method improves uncertainty quantification in BNNs.
Scalable Bayesian inference is maintained with the new priors.
Abstract
Quantifying the uncertainty in the output of a neural network is essential for deployment in scientific or engineering applications where decisions must be made under limited or noisy data. Bayesian neural networks (BNNs) provide a framework for this purpose by constructing a Bayesian posterior distribution over the network parameters. However, the prior, which is of key importance in any Bayesian setting, is rarely meaningful for BNNs. This is because the complexity of the input-to-output map of a BNN makes it difficult to understand how certain distributions enforce any interpretable constraint on the output space of the network. Gaussian processes (GPs), on the other hand, are often preferred in uncertainty quantification tasks due to their interpretability. The drawback is that GPs are limited to small datasets without advanced techniques, which often rely on the covariance kernel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
