Incorporating functional summary information in Bayesian neural networks   using a Dirichlet process likelihood approach

Vishnu Raj; Tianyu Cui; Markus Heinonen; Pekka Marttinen

arXiv:2207.01234·cs.LG·January 25, 2023

Incorporating functional summary information in Bayesian neural networks using a Dirichlet process likelihood approach

Vishnu Raj, Tianyu Cui, Markus Heinonen, Pekka Marttinen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Bayesian neural network method that incorporates external summary information about classification probabilities using a Dirichlet process, improving accuracy and robustness with minimal computational cost.

Contribution

The authors propose a novel approach to embed prior knowledge into BNNs via a Dirichlet process likelihood, enhancing model performance and interpretability.

Findings

01

Improves accuracy and uncertainty calibration.

02

Enhances robustness against data corruptions.

03

Effective with both balanced and imbalanced datasets.

Abstract

Bayesian neural networks (BNNs) can account for both aleatoric and epistemic uncertainty. However, in BNNs the priors are often specified over the weights which rarely reflects true prior knowledge in large and complex neural network architectures. We present a simple approach to incorporate prior knowledge in BNNs based on external summary information about the predicted classification probabilities for a given dataset. The available summary information is incorporated as augmented data and modeled with a Dirichlet process, and we derive the corresponding \emph{Summary Evidence Lower BOund}. The approach is founded on Bayesian principles, and all hyperparameters have a proper probabilistic interpretation. We show how the method can inform the model about task difficulty and class imbalance. Extensive experiments show that, with negligible computational overhead, our method parallels…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

v-i-s-h/summary-likelihood
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)

MethodsSoftmax