Defining Locality for Surrogates in Post-hoc Interpretablity

Thibault Laugel; Xavier Renard; Marie-Jeanne Lesot; Christophe; Marsala; Marcin Detyniecki

arXiv:1806.07498·cs.LG·June 21, 2018·46 cites

Defining Locality for Surrogates in Post-hoc Interpretablity

Thibault Laugel, Xavier Renard, Marie-Jeanne Lesot, Christophe, Marsala, Marcin Detyniecki

PDF

Open Access 1 Repo

TL;DR

This paper emphasizes the importance of defining the right locality in surrogate models for local explanations, proposing a boundary-centered sampling method to improve explanation accuracy and relevance.

Contribution

It introduces a novel approach that samples around the decision boundary rather than the prediction point, enhancing local surrogate explanations.

Findings

01

Boundary-centered sampling improves explanation fidelity.

02

The proposed method outperforms state-of-the-art approaches.

03

Evaluation on four UCI datasets confirms effectiveness.

Abstract

Local surrogate models, to approximate the local decision boundary of a black-box classifier, constitute one approach to generate explanations for the rationale behind an individual prediction made by the back-box. This paper highlights the importance of defining the right locality, the neighborhood on which a local surrogate is trained, in order to approximate accurately the local black-box decision boundary. Unfortunately, as shown in this paper, this issue is not only a parameter or sampling distribution challenge and has a major impact on the relevance and quality of the approximation of the local black-box decision boundary and thus on the meaning and accuracy of the generated explanation. To overcome the identified problems, quantified with an adapted measure and procedure, we propose to generate surrogate-based explanations for individual predictions based on a sampling centered…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

axadil/locality-interpretable-surrogate
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification