Precise Bayesian Neural Networks
Carlos Stein Brito

TL;DR
This paper introduces a normalization-based approach to Bayesian neural networks that models uncertainty in weight directions, leading to more stable, interpretable, and practical uncertainty estimation aligned with network geometry.
Contribution
It proposes a novel von Mises-Fisher posterior on the unit sphere, simplifying BNNs by focusing on weight directions and deriving a dimension-aware KL for improved calibration.
Findings
Improved calibration without sacrificing accuracy.
Stable optimization in high-dimensional settings.
Implementation-ready variational unit compatible with modern architectures.
Abstract
Despite its long history, Bayesian neural networks (BNNs) and variational training remain underused in practice: standard Gaussian posteriors misalign with network geometry, KL terms can be brittle in high dimensions, and implementations often add complexity without reliably improving uncertainty. We revisit the problem through the lens of normalization. Because normalization layers neutralize the influence of weight magnitude, we model uncertainty \emph{only in weight directions} using a von Mises-Fisher posterior on the unit sphere. High-dimensional geometry then yields a single, interpretable scalar per layer--the effective post-normalization noise --that (i) corresponds to simple additive Gaussian noise in the forward pass and (ii) admits a compact, dimension-aware KL in closed form. We derive accurate, closed-form approximations linking concentration …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
