ALIME: Autoencoder Based Approach for Local Interpretability
Sharath M. Shankaranarayana, Davor Runje

TL;DR
This paper introduces ALIME, an autoencoder-enhanced local interpretability method that improves explanation stability and fidelity for deep learning models, especially in sensitive domains like medicine.
Contribution
The paper proposes a novel modification to LIME using autoencoders to enhance explanation stability and local fidelity in model interpretability.
Findings
ALIME outperforms LIME in stability of explanations.
ALIME achieves higher local fidelity across datasets.
Autoencoder integration improves interpretability reliability.
Abstract
Machine learning and especially deep learning have garneredtremendous popularity in recent years due to their increased performanceover other methods. The availability of large amount of data has aidedin the progress of deep learning. Nevertheless, deep learning models areopaque and often seen as black boxes. Thus, there is an inherent need tomake the models interpretable, especially so in the medical domain. Inthis work, we propose a locally interpretable method, which is inspiredby one of the recent tools that has gained a lot of interest, called localinterpretable model-agnostic explanations (LIME). LIME generates singleinstance level explanation by artificially generating a dataset aroundthe instance (by randomly sampling and using perturbations) and thentraining a local linear interpretable model. One of the major issues inLIME is the instability in the generated explanation, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLocal Interpretable Model-Agnostic Explanations
