Scalable Bayesian neural networks by layer-wise input augmentation

Trung Trinh; Samuel Kaski; Markus Heinonen

arXiv:2010.13498·stat.ML·October 27, 2020·1 cites

Scalable Bayesian neural networks by layer-wise input augmentation

Trung Trinh, Samuel Kaski, Markus Heinonen

PDF

Open Access 1 Repo

TL;DR

This paper presents a scalable method for Bayesian neural networks that enhances uncertainty estimation by augmenting layer inputs with latent variables, achieving state-of-the-art results in large-scale image classification.

Contribution

It introduces implicit Bayesian neural networks using layer-wise input augmentation, simplifying inference and improving uncertainty quantification in deep learning.

Findings

01

Achieves state-of-the-art calibration and robustness.

02

Effective uncertainty characterization on large-scale tasks.

03

Scalable approach for Bayesian neural networks.

Abstract

We introduce implicit Bayesian neural networks, a simple and scalable approach for uncertainty representation in deep learning. Standard Bayesian approach to deep learning requires the impractical inference of the posterior distribution over millions of parameters. Instead, we propose to induce a distribution that captures the uncertainty over neural networks by augmenting each layer's inputs with latent variables. We present appropriate input distributions and demonstrate state-of-the-art performance in terms of calibration, robustness and uncertainty characterisation over large-scale, multi-million parameter image classification tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

trungtrinh44/ibnn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Machine Learning and Algorithms