Probabilistic Backpropagation for Scalable Learning of Bayesian Neural Networks
Jos\'e Miguel Hern\'andez-Lobato, Ryan P. Adams

TL;DR
This paper introduces Probabilistic Backpropagation (PBP), a scalable Bayesian neural network training method that improves over traditional techniques by providing calibrated probabilistic predictions and efficient learning on large datasets.
Contribution
The paper presents PBP, a novel scalable Bayesian neural network learning algorithm that combines probabilistic forward passes with backpropagation, enabling efficient training and uncertainty estimation.
Findings
PBP is significantly faster than existing Bayesian methods.
PBP achieves competitive predictive performance.
PBP accurately estimates posterior weight variances.
Abstract
Large multilayer neural networks trained with backpropagation have recently achieved state-of-the-art results in a wide range of problems. However, using backprop for neural net learning still has some disadvantages, e.g., having to tune a large number of hyperparameters to the data, lack of calibrated probabilistic predictions, and a tendency to overfit the training data. In principle, the Bayesian approach to learning neural networks does not have these problems. However, existing Bayesian techniques lack scalability to large dataset and network sizes. In this work we present a novel scalable method for learning Bayesian neural networks, called probabilistic backpropagation (PBP). Similar to classical backpropagation, PBP works by computing a forward propagation of probabilities through the network and then doing a backward computation of gradients. A series of experiments on ten…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
