Bayesian Uncertainty Estimation for Batch Normalized Deep Networks

Mattias Teye; Hossein Azizpour; Kevin Smith

arXiv:1802.06455·stat.ML·July 17, 2018·ICML·105 cites

Bayesian Uncertainty Estimation for Batch Normalized Deep Networks

Mattias Teye, Hossein Azizpour, Kevin Smith

PDF

Open Access 4 Repos

TL;DR

This paper reveals that batch normalization in deep networks can be viewed as approximate Bayesian inference, enabling uncertainty estimation without altering standard training procedures, validated through extensive empirical experiments.

Contribution

It establishes a theoretical link between batch normalization and Bayesian inference, allowing uncertainty estimation in conventional deep networks.

Findings

01

Outperforms baseline methods with statistical significance

02

Provides meaningful uncertainty estimates in various tasks

03

Achieves competitive performance with recent Bayesian approaches

Abstract

We show that training a deep network using batch normalization is equivalent to approximate inference in Bayesian models. We further demonstrate that this finding allows us to make meaningful estimates of the model uncertainty using conventional architectures, without modifications to the network or the training procedure. Our approach is thoroughly validated by measuring the quality of uncertainty in a series of empirical experiments on different tasks. It outperforms baselines with strong statistical significance, and displays competitive performance with recent Bayesian approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

MethodsBatch Normalization