Explainable Artificial Intelligence: Understanding, Visualizing and   Interpreting Deep Learning Models

Wojciech Samek; Thomas Wiegand; Klaus-Robert M\"uller

arXiv:1708.08296·cs.AI·August 29, 2017·1.0k cites

Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

Wojciech Samek, Thomas Wiegand, Klaus-Robert M\"uller

PDF

Open Access

TL;DR

This paper reviews recent advances in explainable AI, emphasizing the importance of interpretability in deep learning models, and introduces two methods for explaining model predictions evaluated on classification tasks.

Contribution

It summarizes recent developments in explainable AI and presents two novel approaches for interpreting deep learning predictions, emphasizing their importance for transparency.

Findings

01

Sensitivity analysis reveals input influence on predictions

02

Input variable decomposition clarifies decision-making process

03

Methods are validated on three classification tasks

Abstract

With the availability of large databases and recent improvements in deep learning methodology, the performance of AI systems is reaching or even exceeding the human level on an increasing number of complex tasks. Impressive examples of this development can be found in domains such as image classification, sentiment analysis, speech understanding or strategic game playing. However, because of their nested non-linear structure, these highly successful machine learning and artificial intelligence models are usually applied in a black box manner, i.e., no information is provided about what exactly makes them arrive at their predictions. Since this lack of transparency can be a major drawback, e.g., in medical applications, the development of methods for visualizing, explaining and interpreting deep learning models has recently attracted increasing attention. This paper summarizes recent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning

MethodsInterpretability