Bayesian Low-Rank LeArning (Bella): A Practical Approach to Bayesian   Neural Networks

Bao Gia Doan; Afshar Shamsi; Xiao-Yu Guo; Arash Mohammadi; Hamid; Alinejad-Rokny; Dino Sejdinovic; Damien Teney; Damith C. Ranasinghe; Ehsan; Abbasnejad

arXiv:2407.20891·cs.LG·February 19, 2025

Bayesian Low-Rank LeArning (Bella): A Practical Approach to Bayesian Neural Networks

Bao Gia Doan, Afshar Shamsi, Xiao-Yu Guo, Arash Mohammadi, Hamid, Alinejad-Rokny, Dino Sejdinovic, Damien Teney, Damith C. Ranasinghe, Ehsan, Abbasnejad

PDF

Open Access 1 Repo

TL;DR

Bella introduces a low-rank perturbation framework that significantly reduces the computational complexity of Bayesian neural networks, enabling scalable, practical, and often superior performance on large-scale tasks.

Contribution

The paper presents Bella, a novel low-rank perturbation approach that makes Bayesian neural networks more scalable and practical for large-scale applications.

Findings

01

Bella reduces the number of trainable parameters significantly.

02

It maintains or surpasses the performance of traditional Bayesian methods.

03

Effective on large-scale datasets like ImageNet and domain-specific tasks.

Abstract

Computational complexity of Bayesian learning is impeding its adoption in practical, large-scale tasks. Despite demonstrations of significant merits such as improved robustness and resilience to unseen or out-of-distribution inputs over their non- Bayesian counterparts, their practical use has faded to near insignificance. In this study, we introduce an innovative framework to mitigate the computational burden of Bayesian neural networks (BNNs). Our approach follows the principle of Bayesian techniques based on deep ensembles, but significantly reduces their cost via multiple low-rank perturbations of parameters arising from a pre-trained neural network. Both vanilla version of ensembles as well as more sophisticated schemes such as Bayesian learning with Stein Variational Gradient Descent (SVGD), previously deemed impractical for large models, can be seamlessly implemented within the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

afsharshamsi/bella
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Neural Networks and Applications · Fault Detection and Control Systems

MethodsContrastive Language-Image Pre-training