LIMIS: Locally Interpretable Modeling using Instance-wise Subsampling

Jinsung Yoon; Sercan O. Arik; Tomas Pfister

arXiv:1909.12367·cs.LG·September 23, 2022·1 cites

LIMIS: Locally Interpretable Modeling using Instance-wise Subsampling

Jinsung Yoon, Sercan O. Arik, Tomas Pfister

PDF

Open Access 1 Repo

TL;DR

LIMIS is a novel framework that enhances locally interpretable models' fidelity by instance-wise subsampling guided by policy gradients, achieving near black-box accuracy and outperforming existing methods.

Contribution

The paper introduces LIMIS, a new approach that uses policy gradient-based instance selection to improve the fidelity of locally interpretable models.

Findings

01

LIMIS achieves high fidelity close to black-box models.

02

LIMIS significantly outperforms existing locally interpretable models.

03

LIMIS maintains prediction accuracy comparable to black-box models.

Abstract

Understanding black-box machine learning models is crucial for their widespread adoption. Learning globally interpretable models is one approach, but achieving high performance with them is challenging. An alternative approach is to explain individual predictions using locally interpretable models. For locally interpretable modeling, various methods have been proposed and indeed commonly used, but they suffer from low fidelity, i.e. their explanations do not approximate the predictions well. In this paper, our goal is to push the state-of-the-art in high-fidelity locally interpretable modeling. We propose a novel framework, Locally Interpretable Modeling using Instance-wise Subsampling (LIMIS). LIMIS utilizes a policy gradient to select a small number of instances and distills the black-box model into a low-capacity locally interpretable model using those selected instances. Training is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/google-research
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning