Understanding Priors in Bayesian Neural Networks at the Unit Level

Mariia Vladimirova; Jakob Verbeek; Pablo Mesejo; Julyan Arbel

arXiv:1810.05193·stat.ML·May 13, 2019·37 cites

Understanding Priors in Bayesian Neural Networks at the Unit Level

Mariia Vladimirova, Jakob Verbeek, Pablo Mesejo, Julyan Arbel

PDF

Open Access

TL;DR

This paper explores how Gaussian priors in deep Bayesian neural networks influence the distribution of unit activations, revealing increasingly heavy-tailed behaviors with depth, and providing new theoretical insights supported by simulations.

Contribution

It characterizes the evolving distributional properties of unit activations in deep Bayesian neural networks with Gaussian priors, revealing a progression from Gaussian to sub-Weibull distributions.

Findings

01

First layer units are Gaussian.

02

Second layer units are sub-exponential.

03

Deeper layer units are sub-Weibull.

Abstract

We investigate deep Bayesian neural networks with Gaussian weight priors and a class of ReLU-like nonlinearities. Bayesian neural networks with Gaussian priors are well known to induce an L2, "weight decay", regularization. Our results characterize a more intricate regularization effect at the level of the unit activations. Our main result establishes that the induced prior distribution on the units before and after activation becomes increasingly heavy-tailed with the depth of the layer. We show that first layer units are Gaussian, second layer units are sub-exponential, and units in deeper layers are characterized by sub-Weibull distributions. Our results provide new theoretical insight on deep Bayesian neural networks, which we corroborate with simulation experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications