ALIME: Autoencoder Based Approach for Local Interpretability

Sharath M. Shankaranarayana; Davor Runje

arXiv:1909.02437·cs.LG·September 6, 2019

ALIME: Autoencoder Based Approach for Local Interpretability

Sharath M. Shankaranarayana, Davor Runje

PDF

TL;DR

This paper introduces ALIME, an autoencoder-enhanced local interpretability method that improves explanation stability and fidelity for deep learning models, especially in sensitive domains like medicine.

Contribution

The paper proposes a novel modification to LIME using autoencoders to enhance explanation stability and local fidelity in model interpretability.

Findings

01

ALIME outperforms LIME in stability of explanations.

02

ALIME achieves higher local fidelity across datasets.

03

Autoencoder integration improves interpretability reliability.

Abstract

Machine learning and especially deep learning have garneredtremendous popularity in recent years due to their increased performanceover other methods. The availability of large amount of data has aidedin the progress of deep learning. Nevertheless, deep learning models areopaque and often seen as black boxes. Thus, there is an inherent need tomake the models interpretable, especially so in the medical domain. Inthis work, we propose a locally interpretable method, which is inspiredby one of the recent tools that has gained a lot of interest, called localinterpretable model-agnostic explanations (LIME). LIME generates singleinstance level explanation by artificially generating a dataset aroundthe instance (by randomly sampling and using perturbations) and thentraining a local linear interpretable model. One of the major issues inLIME is the instability in the generated explanation, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLocal Interpretable Model-Agnostic Explanations