Training Bayesian Neural Networks with Sparse Subspace Variational Inference
Junbo Li, Zichen Miao, Qiang Qiu, Ruqi Zhang

TL;DR
This paper introduces Sparse Subspace Variational Inference (SSVI), a novel fully sparse Bayesian neural network training framework that maintains high sparsity throughout, significantly reducing computational costs while preserving accuracy and uncertainty quantification.
Contribution
The paper presents SSVI, the first fully sparse BNN framework that optimizes sparsity during training via a novel basis selection strategy, improving efficiency and robustness.
Findings
Achieves 10-20x model compression with less than 3% performance loss.
Reduces training FLOPs by up to 20x compared to dense VI.
Enhances robustness to hyperparameters, sometimes surpassing dense BNNs.
Abstract
Bayesian neural networks (BNNs) offer uncertainty quantification but come with the downside of substantially increased training and inference costs. Sparse BNNs have been investigated for efficient inference, typically by either slowly introducing sparsity throughout the training or by post-training compression of dense BNNs. The dilemma of how to cut down massive training costs remains, particularly given the requirement to learn about the uncertainty. To solve this challenge, we introduce Sparse Subspace Variational Inference (SSVI), the first fully sparse BNN framework that maintains a consistently highly sparse Bayesian model throughout the training and inference phases. Starting from a randomly initialized low-dimensional sparse subspace, our approach alternately optimizes the sparse subspace basis selection and its associated parameters. While basis selection is characterized as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Anomaly Detection Techniques and Applications · Neural Networks and Applications
MethodsVariational Inference
