Overpruning in Variational Bayesian Neural Networks

Brian Trippe; Richard Turner

arXiv:1801.06230·stat.ML·January 22, 2018·25 cites

Overpruning in Variational Bayesian Neural Networks

Brian Trippe, Richard Turner

PDF

Open Access

TL;DR

This paper investigates the counter-intuitive phenomenon where more expressive variational approximations in Bayesian neural networks can lead to worse predictions, identifying over-pruning as a key cause and providing a theoretical explanation.

Contribution

It identifies variational over-pruning as a cause of performance degradation and offers a theoretical framework to understand this phenomenon in Bayesian neural networks.

Findings

01

More expressive variational families can worsen predictions due to over-pruning.

02

The paper provides a theoretical explanation for over-pruning in variational Bayesian neural networks.

03

Insights help guide the design of better variational approximations.

Abstract

The motivations for using variational inference (VI) in neural networks differ significantly from those in latent variable models. This has a counter-intuitive consequence; more expressive variational approximations can provide significantly worse predictions as compared to those with less expressive families. In this work we make two contributions. First, we identify a cause of this performance gap, variational over-pruning. Second, we introduce a theoretically grounded explanation for this phenomenon. Our perspective sheds light on several related published results and provides intuition into the design of effective variational approximations of neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications