Analytic Mutual Information in Bayesian Neural Networks

Jae Oh Woo

arXiv:2201.09815·cs.IT·June 22, 2022·1 cites

Analytic Mutual Information in Bayesian Neural Networks

Jae Oh Woo

PDF

Open Access

TL;DR

This paper derives an analytical formula for mutual information in Bayesian neural networks, enhancing understanding of epistemic uncertainty and improving active learning strategies.

Contribution

It provides the first explicit analytical expression for mutual information in Bayesian neural networks, linking it to point process entropy and applying it to active learning.

Findings

01

Analytical formula for mutual information derived

02

Improved active learning performance demonstrated

03

Application to Dirichlet distribution parameter estimation

Abstract

Bayesian neural networks have successfully designed and optimized a robust neural network model in many application problems, including uncertainty quantification. However, with its recent success, information-theoretic understanding about the Bayesian neural network is still at an early stage. Mutual information is an example of an uncertainty measure in a Bayesian neural network to quantify epistemic uncertainty. Still, no analytic formula is known to describe it, one of the fundamental information measures to understand the Bayesian deep learning framework. In this paper, we derive the analytical formula of the mutual information between model parameters and the predictive output by leveraging the notion of the point process entropy. Then, as an application, we discuss the parameter estimation of the Dirichlet distribution and show its practical application in the active learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Fault Detection and Control Systems · Machine Learning in Materials Science