"Why Should You Trust My Explanation?" Understanding Uncertainty in LIME   Explanations

Yujia Zhang; Kuangyan Song; Yiming Sun; Sarah Tan; Madeleine Udell

arXiv:1904.12991·cs.LG·June 5, 2019·69 cites

"Why Should You Trust My Explanation?" Understanding Uncertainty in LIME Explanations

Yujia Zhang, Kuangyan Song, Yiming Sun, Sarah Tan, Madeleine Udell

PDF

Open Access

TL;DR

This paper investigates the uncertainty inherent in LIME explanations, identifying sources of randomness and variation, and demonstrates how this uncertainty affects trust in model interpretations across different datasets.

Contribution

It reveals and analyzes the sources of uncertainty in LIME explanations, highlighting their impact on interpretability and trustworthiness of machine learning models.

Findings

01

Uncertainty arises from sampling randomness and data point variation.

02

Uncertainty exists even in highly accurate models.

03

Empirical analysis on synthetic and real datasets supports the findings.

Abstract

Methods for interpreting machine learning black-box models increase the outcomes' transparency and in turn generates insight into the reliability and fairness of the algorithms. However, the interpretations themselves could contain significant uncertainty that undermines the trust in the outcomes and raises concern about the model's reliability. Focusing on the method "Local Interpretable Model-agnostic Explanations" (LIME), we demonstrate the presence of two sources of uncertainty, namely the randomness in its sampling procedure and the variation of interpretation quality across different input data points. Such uncertainty is present even in models with high training and test accuracy. We apply LIME to synthetic data and two public data sets, text classification in 20 Newsgroup and recidivism risk-scoring in COMPAS, to support our argument.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Topic Modeling

MethodsLocal Interpretable Model-Agnostic Explanations