Sampling Based On Natural Image Statistics Improves Local Surrogate   Explainers

Ricardo Kleinlein; Alexander Hepburn; Ra\'ul Santos-Rodr\'iguez and; Fernando Fern\'andez-Mart\'inez

arXiv:2208.03961·cs.CV·October 11, 2022·1 cites

Sampling Based On Natural Image Statistics Improves Local Surrogate Explainers

Ricardo Kleinlein, Alexander Hepburn, Ra\'ul Santos-Rodr\'iguez and, Fernando Fern\'andez-Mart\'inez

PDF

Open Access

TL;DR

This paper proposes new sampling methods for local surrogate explainers in computer vision that incorporate natural image statistics, leading to more accurate and meaningful explanations of deep neural network predictions.

Contribution

It introduces two approaches to align local sampling with natural image data distribution, improving surrogate explanations without needing access to the original training data.

Findings

01

Sampling based on natural image statistics enhances explanation quality.

02

Perceptual metrics improve the relevance of local neighborhoods.

03

Methods outperform traditional sampling in local surrogate explanations.

Abstract

Many problems in computer vision have recently been tackled using models whose predictions cannot be easily interpreted, most commonly deep neural networks. Surrogate explainers are a popular post-hoc interpretability method to further understand how a model arrives at a particular prediction. By training a simple, more interpretable model to locally approximate the decision boundary of a non-interpretable system, we can estimate the relative importance of the input features on the prediction. Focusing on images, surrogate explainers, e.g., LIME, generate a local neighbourhood around a query image by sampling in an interpretable domain. However, these interpretable domains have traditionally been derived exclusively from the intrinsic features of the query image, not taking into consideration the manifold of the data the non-interpretable model has been exposed to in training (or more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification

MethodsLocal Interpretable Model-Agnostic Explanations