Regularizing Explanations in Bayesian Convolutional Neural Networks
Yanzhe Bekkemoen, Helge Langseth

TL;DR
This paper introduces a novel explanation regularization technique compatible with Bayesian neural networks, enhancing their ability to focus on relevant features, improve predictive accuracy, and quantify uncertainty, especially when overfitting on spurious patterns.
Contribution
It proposes a new explanation regularization method that works with Bayesian inference, enabling better feature focus and uncertainty estimation in neural networks.
Findings
Improves predictive performance on datasets with spurious features.
Outperforms data augmentation methods in regularizing explanations.
Enhances model focus on relevant features while maintaining uncertainty quantification.
Abstract
Neural networks are powerful function approximators with tremendous potential in learning complex distributions. However, they are prone to overfitting on spurious patterns. Bayesian inference provides a principled way to regularize neural networks and give well-calibrated uncertainty estimates. It allows us to specify prior knowledge on weights. However, specifying domain knowledge via distributions over weights is infeasible. Furthermore, it is unable to correct models when they focus on spurious or irrelevant features. New methods within explainable artificial intelligence allow us to regularize explanations in the form of feature importance to add domain knowledge and correct the models' focus. Nevertheless, they are incompatible with Bayesian neural networks, as they require us to modify the loss function. We propose a new explanation regularization method that is compatible with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Bayesian Modeling and Causal Inference
