Explaining NonLinear Classification Decisions with Deep Taylor   Decomposition

Gr\'egoire Montavon; Sebastian Bach; Alexander Binder; Wojciech Samek,; Klaus-Robert M\"uller

arXiv:1512.02479·cs.LG·November 15, 2017

Explaining NonLinear Classification Decisions with Deep Taylor Decomposition

Gr\'egoire Montavon, Sebastian Bach, Alexander Binder, Wojciech Samek,, Klaus-Robert M\"uller

PDF

4 Repos

TL;DR

This paper introduces a novel method called Deep Taylor Decomposition for interpreting nonlinear deep neural network decisions, enhancing transparency by attributing classification outcomes to input features, applicable across various data types and architectures.

Contribution

It presents a new explanation technique for neural networks based on deep Taylor decomposition, improving interpretability of complex models.

Findings

01

Effective at explaining image classification decisions

02

Applicable to various neural network architectures

03

Demonstrated on MNIST and ILSVRC datasets

Abstract

Nonlinear methods such as Deep Neural Networks (DNNs) are the gold standard for various challenging machine learning problems, e.g., image classification, natural language processing or human action recognition. Although these methods perform impressively well, they have a significant disadvantage, the lack of transparency, limiting the interpretability of the solution and thus the scope of application in practice. Especially DNNs act as black boxes due to their multilayer nonlinear structure. In this paper we introduce a novel methodology for interpreting generic multilayer neural networks by decomposing the network classification decision into contributions of its input elements. Although our focus is on image classification, the method is applicable to a broad set of input data, learning tasks and network architectures. Our method is based on deep Taylor decomposition and efficiently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsInterpretability