Incorporating functional summary information in Bayesian neural networks using a Dirichlet process likelihood approach
Vishnu Raj, Tianyu Cui, Markus Heinonen, Pekka Marttinen

TL;DR
This paper introduces a Bayesian neural network method that incorporates external summary information about classification probabilities using a Dirichlet process, improving accuracy and robustness with minimal computational cost.
Contribution
The authors propose a novel approach to embed prior knowledge into BNNs via a Dirichlet process likelihood, enhancing model performance and interpretability.
Findings
Improves accuracy and uncertainty calibration.
Enhances robustness against data corruptions.
Effective with both balanced and imbalanced datasets.
Abstract
Bayesian neural networks (BNNs) can account for both aleatoric and epistemic uncertainty. However, in BNNs the priors are often specified over the weights which rarely reflects true prior knowledge in large and complex neural network architectures. We present a simple approach to incorporate prior knowledge in BNNs based on external summary information about the predicted classification probabilities for a given dataset. The available summary information is incorporated as augmented data and modeled with a Dirichlet process, and we derive the corresponding \emph{Summary Evidence Lower BOund}. The approach is founded on Bayesian principles, and all hyperparameters have a proper probabilistic interpretation. We show how the method can inform the model about task difficulty and class imbalance. Extensive experiments show that, with negligible computational overhead, our method parallels…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
MethodsSoftmax
