Bayesian Low-Rank LeArning (Bella): A Practical Approach to Bayesian Neural Networks
Bao Gia Doan, Afshar Shamsi, Xiao-Yu Guo, Arash Mohammadi, Hamid, Alinejad-Rokny, Dino Sejdinovic, Damien Teney, Damith C. Ranasinghe, Ehsan, Abbasnejad

TL;DR
Bella introduces a low-rank perturbation framework that significantly reduces the computational complexity of Bayesian neural networks, enabling scalable, practical, and often superior performance on large-scale tasks.
Contribution
The paper presents Bella, a novel low-rank perturbation approach that makes Bayesian neural networks more scalable and practical for large-scale applications.
Findings
Bella reduces the number of trainable parameters significantly.
It maintains or surpasses the performance of traditional Bayesian methods.
Effective on large-scale datasets like ImageNet and domain-specific tasks.
Abstract
Computational complexity of Bayesian learning is impeding its adoption in practical, large-scale tasks. Despite demonstrations of significant merits such as improved robustness and resilience to unseen or out-of-distribution inputs over their non- Bayesian counterparts, their practical use has faded to near insignificance. In this study, we introduce an innovative framework to mitigate the computational burden of Bayesian neural networks (BNNs). Our approach follows the principle of Bayesian techniques based on deep ensembles, but significantly reduces their cost via multiple low-rank perturbations of parameters arising from a pre-trained neural network. Both vanilla version of ensembles as well as more sophisticated schemes such as Bayesian learning with Stein Variational Gradient Descent (SVGD), previously deemed impractical for large models, can be seamlessly implemented within the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Neural Networks and Applications · Fault Detection and Control Systems
MethodsContrastive Language-Image Pre-training
