Model-Agnostic Interpretability of Machine Learning

Marco Tulio Ribeiro; Sameer Singh; Carlos Guestrin

arXiv:1606.05386·stat.ML·June 20, 2016·697 cites

Model-Agnostic Interpretability of Machine Learning

Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin

PDF

Open Access

TL;DR

This paper advocates for model-agnostic interpretability methods that treat machine learning models as black boxes, offering flexible explanations applicable across various models and enhancing understanding, debugging, and user trust.

Contribution

It reviews the importance of model-agnostic explanations in machine learning and discusses the LIME approach as a recent solution to interpretability challenges.

Findings

01

Model-agnostic explanations improve understanding of black-box models.

02

LIME provides local, interpretable explanations for any classifier.

03

Such methods enhance debugging and user trust in machine learning systems.

Abstract

Understanding why machine learning models behave the way they do empowers both system designers and end-users in many ways: in model selection, feature engineering, in order to trust and act upon the predictions, and in more intuitive user interfaces. Thus, interpretability has become a vital concern in machine learning, and work in the area of interpretable models has found renewed interest. In some applications, such models are as accurate as non-interpretable ones, and thus are preferred for their transparency. Even when they are not accurate, they may still be preferred when interpretability is of paramount importance. However, restricting machine learning to interpretable models is often a severe limitation. In this paper we argue for explaining machine learning predictions using model-agnostic approaches. By treating the machine learning models as black-box functions, these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification

MethodsInterpretability