On Interpretability of Artificial Neural Networks: A Survey

Fenglei Fan; Jinjun Xiong; Mengzhou Li; and Ge Wang

arXiv:2001.02522·cs.LG·September 29, 2021·46 cites

On Interpretability of Artificial Neural Networks: A Survey

Fenglei Fan, Jinjun Xiong, Mengzhou Li, and Ge Wang

PDF

Open Access 1 Repo

TL;DR

This survey reviews recent research on understanding and interpreting deep neural networks, highlighting their importance for critical applications like medicine and discussing future research directions.

Contribution

It provides a comprehensive taxonomy of interpretability methods, summarizes applications in medicine, and explores future directions including fuzzy logic and brain science.

Findings

01

Systematic review of interpretability techniques

02

Applications of interpretability in medical diagnosis

03

Discussion of future research directions

Abstract

Deep learning as represented by the artificial deep neural networks (DNNs) has achieved great success in many important areas that deal with text, images, videos, graphs, and so on. However, the black-box nature of DNNs has become one of the primary obstacles for their wide acceptance in mission-critical applications such as medical diagnosis and therapy. Due to the huge potential of deep learning, interpreting neural networks has recently attracted much research attention. In this paper, based on our comprehensive taxonomy, we systematically review recent studies in understanding the mechanism of neural networks, describe applications of interpretability especially in medicine, and discuss future directions of interpretability research, such as in relation to fuzzy logic and brain science.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

FengleiFan/IndependentEvaluation
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare

MethodsInterpretability