"Why Should I Trust You?": Explaining the Predictions of Any Classifier
Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin

TL;DR
This paper introduces LIME, a technique for explaining individual predictions of any classifier in an interpretable way, enhancing trust and understanding of machine learning models across different domains.
Contribution
The paper presents LIME, a novel local explanation method applicable to any classifier, and a submodular optimization approach for selecting representative explanations, improving interpretability and trust.
Findings
LIME provides faithful local explanations for text and image classifiers.
The explanation method improves trust and understanding among users.
Experiments show the effectiveness of explanations in various trust-related scenarios.
Abstract
Despite widespread adoption, machine learning models remain mostly black boxes. Understanding the reasons behind predictions is, however, quite important in assessing trust, which is fundamental if one plans to take action based on a prediction, or when choosing whether to deploy a new model. Such understanding also provides insights into the model, which can be used to transform an untrustworthy model or prediction into a trustworthy one. In this work, we propose LIME, a novel explanation technique that explains the predictions of any classifier in an interpretable and faithful manner, by learning an interpretable model locally around the prediction. We also propose a method to explain models by presenting representative individual predictions and their explanations in a non-redundant way, framing the task as a submodular optimization problem. We demonstrate the flexibility of these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Stanford Seminar - How can you trust machine learning? Carlos Guestrin· youtube
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification
MethodsLocal Interpretable Model-Agnostic Explanations
