BayesNAM: Leveraging Inconsistency for Reliable Explanations
Hoki Kim, Jinseong Park, Yujin Choi, Seungyun Lee, and Jaewook Lee

TL;DR
This paper introduces BayesNAM, a Bayesian neural additive model that leverages explanation inconsistencies to identify data issues and improve interpretability in neural additive models.
Contribution
It presents a novel framework that uses Bayesian neural networks and feature dropout to harness explanation inconsistencies for more reliable AI explanations.
Findings
BayesNAM effectively reveals data problems like insufficient data.
Theoretical proof shows feature dropout captures model inconsistencies.
Experiments demonstrate improved explanation reliability.
Abstract
Neural additive model (NAM) is a recently proposed explainable artificial intelligence (XAI) method that utilizes neural network-based architectures. Given the advantages of neural networks, NAMs provide intuitive explanations for their predictions with high model performance. In this paper, we analyze a critical yet overlooked phenomenon: NAMs often produce inconsistent explanations, even when using the same architecture and dataset. Traditionally, such inconsistencies have been viewed as issues to be resolved. However, we argue instead that these inconsistencies can provide valuable explanations within the given data model. Through a simple theoretical framework, we demonstrate that these inconsistencies are not mere artifacts but emerge naturally in datasets with multiple important features. To effectively leverage this information, we introduce a novel framework, Bayesian Neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
MethodsDropout
