Structured Partial Stochasticity in Bayesian Neural Networks

Tommy Rochussen

arXiv:2405.17666·stat.ML·July 3, 2024

Structured Partial Stochasticity in Bayesian Neural Networks

Tommy Rochussen

PDF

Open Access

TL;DR

This paper introduces a structured approach to select deterministic weights in Bayesian neural networks, reducing redundant modes and improving the efficiency and performance of approximate inference methods.

Contribution

It proposes a novel structured method to eliminate neuron permutation symmetries, simplifying the posterior distribution in Bayesian neural networks.

Findings

01

Simplified posterior improves inference performance

02

Reduces redundant modes in Bayesian neural network posteriors

03

Enhances efficiency of approximate inference methods

Abstract

Bayesian neural network posterior distributions have a great number of modes that correspond to the same network function. The abundance of such modes can make it difficult for approximate inference methods to do their job. Recent work has demonstrated the benefits of partial stochasticity for approximate inference in Bayesian neural networks; inference can be less costly and performance can sometimes be improved. I propose a structured way to select the deterministic subset of weights that removes neuron permutation symmetries, and therefore the corresponding redundant posterior modes. With a drastically simplified posterior distribution, the performance of existing approximate inference schemes is found to be greatly improved.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications