A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference
Kumar Shridhar, Felix Laumann, Marcus Liwicki

TL;DR
This paper introduces Bayesian Convolutional Neural Networks using Variational Inference to model uncertainty, improve regularization, and eliminate dropout, demonstrating comparable performance to traditional methods across various image tasks.
Contribution
It proposes a novel Bayesian CNN architecture with variational inference, applying dual convolutional operations for mean and variance, and explores uncertainty quantification and model pruning.
Findings
Achieves performance comparable to point-estimate CNNs on multiple datasets.
Provides uncertainty measures that improve decision reliability.
Eliminates dropout, reducing model complexity and training time.
Abstract
Artificial Neural Networks are connectionist systems that perform a given task by learning on examples without having prior knowledge about the task. This is done by finding an optimal point estimate for the weights in every node. Generally, the network using point estimates as weights perform well with large datasets, but they fail to express uncertainty in regions with little or no data, leading to overconfident decisions. In this paper, Bayesian Convolutional Neural Network (BayesCNN) using Variational Inference is proposed, that introduces probability distribution over the weights. Furthermore, the proposed BayesCNN architecture is applied to tasks like Image Classification, Image Super-Resolution and Generative Adversarial Networks. The results are compared to point-estimates based architectures on MNIST, CIFAR-10 and CIFAR-100 datasets for Image CLassification task, on BSD300…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Anomaly Detection Techniques and Applications
MethodsDropout
