Techniques for Interpretable Machine Learning

Mengnan Du; Ninghao Liu; Xia Hu

arXiv:1808.00033·cs.LG·May 21, 2019·28 cites

Techniques for Interpretable Machine Learning

Mengnan Du, Ninghao Liu, Xia Hu

PDF

Open Access

TL;DR

This survey reviews existing techniques for making machine learning models more interpretable, discusses current challenges, and highlights future research directions including user-friendly explanations and evaluation metrics.

Contribution

It provides a comprehensive overview of interpretability methods and identifies key issues for advancing the field of interpretable machine learning.

Findings

01

Summarizes various interpretability techniques

02

Highlights challenges in explanation design

03

Suggests future research directions

Abstract

Interpretable machine learning tackles the important problem that humans cannot understand the behaviors of complex machine learning models and how these models arrive at a particular decision. Although many approaches have been proposed, a comprehensive understanding of the achievements and challenges is still lacking. We provide a survey covering existing techniques to increase the interpretability of machine learning models. We also discuss crucial issues that the community should consider in future work such as designing user-friendly explanations and developing comprehensive evaluation metrics to further push forward the area of interpretable machine learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification

MethodsInterpretability