Uncertainty-Preserving QBNNs: Multi-Level Quantization of SVI-Based Bayesian Neural Networks for Image Classification
Hendrik Borras, Yong Wu, Bernhard Klein, Holger Fr\"oning

TL;DR
This paper presents a multi-level quantization framework for Bayesian Neural Networks that preserves uncertainty estimates while significantly reducing memory requirements, enabling efficient deployment on edge devices.
Contribution
It introduces a systematic quantization approach for SVI-based BNNs, including novel strategies like VPQ, SPQ, and JQ, with specialized techniques for variance and distribution preservation.
Findings
BNNs can be quantized to 4-bit precision with minimal accuracy loss.
Joint Quantization achieves up to 8x memory reduction at 4 bits.
Uncertainty estimates remain reliable after quantization.
Abstract
Bayesian Neural Networks (BNNs) provide principled uncertainty quantification but suffer from substantial computational and memory overhead compared to deterministic networks. While quantization techniques have successfully reduced resource requirements in standard deep learning models, their application to probabilistic models remains largely unexplored. We introduce a systematic multi-level quantization framework for Stochastic Variational Inference based BNNs that distinguishes between three quantization strategies: Variational Parameter Quantization (VPQ), Sampled Parameter Quantization (SPQ), and Joint Quantization (JQ). Our logarithmic quantization for variance parameters, and specialized activation functions to preserve the distributional structure are essential for calibrated uncertainty estimation. Through comprehensive experiments on Dirty-MNIST, we demonstrate that BNNs can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Advanced Memory and Neural Computing
