"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Marco Tulio Ribeiro; Sameer Singh; Carlos Guestrin

arXiv:1602.04938·cs.LG·August 10, 2016·347 cites

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin

PDF

Open Access 5 Repos 1 Video

TL;DR

This paper introduces LIME, a technique for explaining individual predictions of any classifier in an interpretable way, enhancing trust and understanding of machine learning models across different domains.

Contribution

The paper presents LIME, a novel local explanation method applicable to any classifier, and a submodular optimization approach for selecting representative explanations, improving interpretability and trust.

Findings

01

LIME provides faithful local explanations for text and image classifiers.

02

The explanation method improves trust and understanding among users.

03

Experiments show the effectiveness of explanations in various trust-related scenarios.

Abstract

Despite widespread adoption, machine learning models remain mostly black boxes. Understanding the reasons behind predictions is, however, quite important in assessing trust, which is fundamental if one plans to take action based on a prediction, or when choosing whether to deploy a new model. Such understanding also provides insights into the model, which can be used to transform an untrustworthy model or prediction into a trustworthy one. In this work, we propose LIME, a novel explanation technique that explains the predictions of any classifier in an interpretable and faithful manner, by learning an interpretable model locally around the prediction. We also propose a method to explain models by presenting representative individual predictions and their explanations in a non-redundant way, framing the task as a submodular optimization problem. We demonstrate the flexibility of these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Stanford Seminar - How can you trust machine learning? Carlos Guestrin· youtube

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification

MethodsLocal Interpretable Model-Agnostic Explanations