Singular Bayesian Neural Networks
Mame Diarra Toure, David A. Stephens

TL;DR
This paper introduces a low-rank parameterization for Bayesian neural networks that reduces parameter count and improves uncertainty estimation, out-of-distribution detection, and calibration.
Contribution
It proposes a singular Bayesian neural network approach using low-rank weights, deriving new generalization bounds and demonstrating empirical benefits over existing methods.
Findings
Achieves up to 33x fewer parameters than Deep Ensembles.
Improves out-of-distribution detection and calibration.
Maintains competitive predictive performance.
Abstract
Bayesian neural networks promise calibrated uncertainty but require parameters for standard mean-field Gaussian posteriors. We argue this cost is often unnecessary, particularly when weight matrices exhibit fast singular value decay. By parameterizing weights as with , , we induce a posterior that is \emph{singular} with respect to the Lebesgue measure, concentrating on the rank- manifold. This singularity captures structured weight correlations through shared latent factors, geometrically distinct from mean-field's independence assumption. We derive PAC-Bayes generalization bounds whose complexity term scales as instead of , and prove loss bounds that decompose the error into optimization and rank-induced bias using the Eckart-Young-Mirsky theorem. We further adapt…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
