Uncertainty Estimation via Stochastic Batch Normalization

Andrei Atanov; Arsenii Ashukha; Dmitry Molchanov; Kirill Neklyudov,; Dmitry Vetrov

arXiv:1802.04893·stat.ML·March 22, 2018

Uncertainty Estimation via Stochastic Batch Normalization

Andrei Atanov, Arsenii Ashukha, Dmitry Molchanov, Kirill Neklyudov,, Dmitry Vetrov

PDF

1 Repo

TL;DR

This paper introduces Stochastic Batch Normalization, a scalable method for uncertainty estimation in deep neural networks, based on a probabilistic interpretation of Batch Normalization, with demonstrated effectiveness on standard datasets.

Contribution

It provides a probabilistic interpretation of Batch Normalization and proposes Stochastic Batch Normalization as an efficient approximation for uncertainty estimation.

Findings

01

Effective uncertainty estimation on MNIST and CIFAR-10

02

Compatible with popular architectures like VGG and ResNets

03

Reduces computational cost compared to exact inference

Abstract

In this work, we investigate Batch Normalization technique and propose its probabilistic interpretation. We propose a probabilistic model and show that Batch Normalization maximazes the lower bound of its marginalized log-likelihood. Then, according to the new probabilistic model, we design an algorithm which acts consistently during train and test. However, inference becomes computationally inefficient. To reduce memory and computational cost, we propose Stochastic Batch Normalization -- an efficient approximation of proper inference procedure. This method provides us with a scalable uncertainty estimation technique. We demonstrate the performance of Stochastic Batch Normalization on popular architectures (including deep convolutional architectures: VGG-like and ResNets) for MNIST and CIFAR-10 datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Glutamat42/mcbn-pytorch
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsBatch Normalization