Bayesian Neural Networks with Domain Knowledge Priors

Dylan Sam; Rattana Pukdee; Daniel P. Jeong; Yewon Byun; J. Zico Kolter

arXiv:2402.13410·cs.LG·February 22, 2024·2 cites

Bayesian Neural Networks with Domain Knowledge Priors

Dylan Sam, Rattana Pukdee, Daniel P. Jeong, Yewon Byun, J. Zico Kolter

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a framework for integrating domain knowledge into Bayesian neural network priors via loss-function-based constraints, enhancing predictive accuracy and interpretability.

Contribution

It proposes a novel method to incorporate diverse domain knowledge into BNN priors through variational inference, improving model performance and transferability.

Findings

01

BNNs with domain knowledge priors outperform standard priors.

02

The approach effectively incorporates fairness, physics, and healthcare knowledge.

03

Priors can be transferred across different model architectures.

Abstract

Bayesian neural networks (BNNs) have recently gained popularity due to their ability to quantify model uncertainty. However, specifying a prior for BNNs that captures relevant domain knowledge is often extremely challenging. In this work, we propose a framework for integrating general forms of domain knowledge (i.e., any knowledge that can be represented by a loss function) into a BNN prior through variational inference, while enabling computationally efficient posterior inference and sampling. Specifically, our approach results in a prior over neural network weights that assigns high probability mass to models that better align with our domain knowledge, leading to posterior samples that also exhibit this behavior. We show that BNNs using our proposed domain knowledge priors outperform those with standard priors (e.g., isotropic Gaussian, Gaussian process), successfully incorporating…

Peer Reviews

Decision·Submitted to ICLR 2025

Reviewer 01Rating 5Confidence 4

Strengths

The manuscript tries to address the important topic of informative priors for efficiently modeling Bayesian inference to incorporate domain knowledge.

Weaknesses

- The theoretical justification for phi loss incorporating domain knowledge is not clear; this proposed formulation resembles an empirical Bayes setup where informative priors are learned from the data. - There is a large body of work on informative priors that considers techniques such as empirical Bayes and hierarchical Bayes to incorporate domain knowledge. - In the results section, it is unclear why the comparison of phi values against the selected datasets indicates the incorporation of dom

Reviewer 02Rating 5Confidence 4

Strengths

* The paper is well-motivated to specify informed priors in BNNs that reflects the relevant domain knowledge and mitigate undesirable biases. * The proposed framework enables the integration of generic forms of domain knowledge such as physics rules, fairness, healthcare knowledge into BNN prior, and also proposes strategy to transfer the priors to other models without the need to relearn a new prior every time.

Weaknesses

* One of the main objective of Bayesian neural network (BNN) is uncertainty quantification (UQ) in their predictions. However, this paper does not address the ability of BNNs to reliably quantify model uncertainty in the study and experimental evaluation. The experiments conducted in the study are limited to evaluating predictive accuracy and domain knowledge surrogate loss, neglecting the critical UQ aspect of BNNs. This raises concerns about the evaluation of BNNs with the proposed informed pr

Reviewer 03Rating 5Confidence 4

Strengths

- The paper is well-motivated, and the approach is clean. Incorporating domain knowledge into informative priors is an interesting direction for Bayesian deep learning. - The paper proposes four general domain knowledge losses that supplement the available training data in meaningful ways. - The paper demonstrates that the learned prior is better aligned with domain knowledge and often improves predictive performance compared to standard priors.

Weaknesses

- My main concern is that I'm not convinced that the best way to use the proposed domain knowledge losses is to learn a prior over weights upfront. As the paper states in page 4, we could also use these losses to regularize the training process. You could similarly train a BNN with a likelihood function that incorporates these losses. Could the authors explain why learning a prior is better than these alternatives? Section A.5 empirically compares this alternative somewhat. Still, I think a more

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsALIGN