Sampling Based On Natural Image Statistics Improves Local Surrogate Explainers
Ricardo Kleinlein, Alexander Hepburn, Ra\'ul Santos-Rodr\'iguez and, Fernando Fern\'andez-Mart\'inez

TL;DR
This paper proposes new sampling methods for local surrogate explainers in computer vision that incorporate natural image statistics, leading to more accurate and meaningful explanations of deep neural network predictions.
Contribution
It introduces two approaches to align local sampling with natural image data distribution, improving surrogate explanations without needing access to the original training data.
Findings
Sampling based on natural image statistics enhances explanation quality.
Perceptual metrics improve the relevance of local neighborhoods.
Methods outperform traditional sampling in local surrogate explanations.
Abstract
Many problems in computer vision have recently been tackled using models whose predictions cannot be easily interpreted, most commonly deep neural networks. Surrogate explainers are a popular post-hoc interpretability method to further understand how a model arrives at a particular prediction. By training a simple, more interpretable model to locally approximate the decision boundary of a non-interpretable system, we can estimate the relative importance of the input features on the prediction. Focusing on images, surrogate explainers, e.g., LIME, generate a local neighbourhood around a query image by sampling in an interpretable domain. However, these interpretable domains have traditionally been derived exclusively from the intrinsic features of the query image, not taking into consideration the manifold of the data the non-interpretable model has been exposed to in training (or more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
MethodsLocal Interpretable Model-Agnostic Explanations
