Local Interpretable Model-agnostic Explanations of Bayesian Predictive Models via Kullback-Leibler Projections
Tomi Peltola

TL;DR
KL-LIME is a novel method that explains Bayesian model predictions locally by projecting complex predictive distributions onto simpler interpretable models, balancing fidelity and complexity.
Contribution
It combines LIME with Bayesian projection techniques to improve local explanations of Bayesian predictive models.
Findings
Effective explanation of Bayesian neural network predictions on MNIST.
Balances explanation fidelity and interpretability using information theory.
Demonstrates applicability to deep convolutional neural networks.
Abstract
We introduce a method, KL-LIME, for explaining predictions of Bayesian predictive models by projecting the information in the predictive distribution locally to a simpler, interpretable explanation model. The proposed approach combines the recent Local Interpretable Model-agnostic Explanations (LIME) method with ideas from Bayesian projection predictive variable selection methods. The information theoretic basis helps in navigating the trade-off between explanation fidelity and complexity. We demonstrate the method in explaining MNIST digit classifications made by a Bayesian deep convolutional neural network.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare
