Singular Bayesian Neural Networks

Mame Diarra Toure; David A. Stephens

arXiv:2602.00387·stat.ML·May 5, 2026

Singular Bayesian Neural Networks

Mame Diarra Toure, David A. Stephens

PDF

TL;DR

This paper introduces a low-rank parameterization for Bayesian neural networks that reduces parameter count and improves uncertainty estimation, out-of-distribution detection, and calibration.

Contribution

It proposes a singular Bayesian neural network approach using low-rank weights, deriving new generalization bounds and demonstrating empirical benefits over existing methods.

Findings

01

Achieves up to 33x fewer parameters than Deep Ensembles.

02

Improves out-of-distribution detection and calibration.

03

Maintains competitive predictive performance.

Abstract

Bayesian neural networks promise calibrated uncertainty but require $O (mn)$ parameters for standard mean-field Gaussian posteriors. We argue this cost is often unnecessary, particularly when weight matrices exhibit fast singular value decay. By parameterizing weights as $W = A B^{⊤}$ with $A \in R^{m \times r}$ , $B \in R^{n \times r}$ , we induce a posterior that is \emph{singular} with respect to the Lebesgue measure, concentrating on the rank- $r$ manifold. This singularity captures structured weight correlations through shared latent factors, geometrically distinct from mean-field's independence assumption. We derive PAC-Bayes generalization bounds whose complexity term scales as $r (m + n)$ instead of $mn$ , and prove loss bounds that decompose the error into optimization and rank-induced bias using the Eckart-Young-Mirsky theorem. We further adapt…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.