Approximate Message Passing for Bayesian Neural Networks
Romeo Sommerfeld, Christian Helms, Ralf Herbrich

TL;DR
This paper introduces a novel message passing framework for Bayesian neural networks that effectively models the predictive posterior, handles convolutional architectures, and improves uncertainty calibration, addressing key limitations of existing methods.
Contribution
It presents the first message passing approach for convolutional neural networks that avoids double-counting data and enhances uncertainty quantification in Bayesian neural networks.
Findings
Competitive performance with SOTA baselines on CIFAR-10
Strong correlation (0.9) between credible intervals and true data-generating function
Scales to large MLPs with 5.6 million parameters
Abstract
Bayesian neural networks (BNNs) offer the potential for reliable uncertainty quantification and interpretability, which are critical for trustworthy AI in high-stakes domains. However, existing methods often struggle with issues such as overconfidence, hyperparameter sensitivity, and posterior collapse, leaving room for alternative approaches. In this work, we advance message passing (MP) for BNNs and present a novel framework that models the predictive posterior as a factor graph. To the best of our knowledge, our framework is the first MP method that handles convolutional neural networks and avoids double-counting training data, a limitation of previous MP methods that causes overconfidence. We evaluate our approach on CIFAR-10 with a convolutional neural network of roughly 890k parameters and find that it can compete with the SOTA baselines AdamW and IVON, even having an edge in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Fault Detection and Control Systems · Target Tracking and Data Fusion in Sensor Networks
MethodsVariational Inference · AdamW
