On the Convergence of Locally Adaptive and Scalable Diffusion-Based   Sampling Methods for Deep Bayesian Neural Network Posteriors

Tim Rensmeyer; Oliver Niggemann

arXiv:2403.08609·cs.LG·March 15, 2024·1 cites

On the Convergence of Locally Adaptive and Scalable Diffusion-Based Sampling Methods for Deep Bayesian Neural Network Posteriors

Tim Rensmeyer, Oliver Niggemann

PDF

Open Access 1 Repo

TL;DR

This paper investigates the convergence properties of adaptive diffusion-based sampling methods for Bayesian neural networks, revealing potential biases even with small step sizes and full batch data, impacting uncertainty quantification.

Contribution

It critically analyzes existing adaptive sampling algorithms, demonstrating their potential bias and limitations in accurately sampling from neural network posteriors.

Findings

01

Existing methods can have substantial bias in the sampled distribution.

02

Bias persists even with vanishing step sizes and full batch data.

03

Challenges in achieving reliable uncertainty quantification in deep learning.

Abstract

Achieving robust uncertainty quantification for deep neural networks represents an important requirement in many real-world applications of deep learning such as medical imaging where it is necessary to assess the reliability of a neural network's prediction. Bayesian neural networks are a promising approach for modeling uncertainties in deep neural networks. Unfortunately, generating samples from the posterior distribution of neural networks is a major challenge. One significant advance in that direction would be the incorporation of adaptive step sizes, similar to modern neural network optimizers, into Monte Carlo Markov chain sampling algorithms without significantly increasing computational demand. Over the past years, several papers have introduced sampling algorithms with claims that they achieve this property. However, do they indeed converge to the correct distribution? In this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

timrensmeyer/Convergence-Experiments
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Seismic Imaging and Inversion Techniques · Image and Signal Denoising Methods