Variational Inference Failures Under Model Symmetries: Permutation   Invariant Posteriors for Bayesian Neural Networks

Yoav Gelberg; Tycho F.A. van der Ouderaa; Mark van der Wilk; Yarin Gal

arXiv:2408.05496·cs.LG·August 13, 2024

Variational Inference Failures Under Model Symmetries: Permutation Invariant Posteriors for Bayesian Neural Networks

Yoav Gelberg, Tycho F.A. van der Ouderaa, Mark van der Wilk, Yarin Gal

PDF

Open Access

TL;DR

This paper investigates how permutation symmetries in neural network weights cause multimodal posteriors that challenge variational inference, and proposes a symmetrization method to improve posterior approximation and predictive performance.

Contribution

The authors introduce a symmetrization mechanism for permutation invariant variational posteriors, improving the fit to true posteriors in Bayesian neural networks with weight symmetries.

Findings

01

Symmetrized posteriors better fit true posteriors.

02

Improved predictive accuracy with symmetrization.

03

Higher ELBO achieved using the proposed method.

Abstract

Weight space symmetries in neural network architectures, such as permutation symmetries in MLPs, give rise to Bayesian neural network (BNN) posteriors with many equivalent modes. This multimodality poses a challenge for variational inference (VI) techniques, which typically rely on approximating the posterior with a unimodal distribution. In this work, we investigate the impact of weight space permutation symmetries on VI. We demonstrate, both theoretically and empirically, that these symmetries lead to biases in the approximate posterior, which degrade predictive performance and posterior fit if not explicitly accounted for. To mitigate this behavior, we leverage the symmetric structure of the posterior and devise a symmetrization mechanism for constructing permutation invariant variational posteriors. We show that the symmetrized distribution has a strictly better fit to the true…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference

MethodsVariational Inference