Investigating maximum likelihood based training of infinite mixtures for   uncertainty quantification

Sina D\"aubener; Asja Fischer

arXiv:2008.03209·cs.LG·October 6, 2020

Investigating maximum likelihood based training of infinite mixtures for uncertainty quantification

Sina D\"aubener, Asja Fischer

PDF

Open Access

TL;DR

This paper explores training infinite mixture models for neural network uncertainty quantification using maximum likelihood, showing improved robustness, uncertainty estimation, and out-of-distribution detection over traditional variational methods.

Contribution

It introduces a novel approach to train infinite mixture models with maximum likelihood, enhancing uncertainty quantification without variational inference.

Findings

01

Increased predictive variance improves uncertainty estimation.

02

Enhanced robustness against adversarial attacks.

03

Higher entropy on out-of-distribution data.

Abstract

Uncertainty quantification in neural networks gained a lot of attention in the past years. The most popular approaches, Bayesian neural networks (BNNs), Monte Carlo dropout, and deep ensembles have one thing in common: they are all based on some kind of mixture model. While the BNNs build infinite mixture models and are derived via variational inference, the latter two build finite mixtures trained with the maximum likelihood method. In this work we investigate the effect of training an infinite mixture distribution with the maximum likelihood method instead of variational inference. We find that the proposed objective leads to stochastic networks with an increased predictive variance, which improves uncertainty based identification of miss-classification and robustness against adversarial attacks in comparison to a standard BNN with equivalent network structure. The new model also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Gaussian Processes and Bayesian Inference

MethodsDeep Ensembles