On Interpretability of Artificial Neural Networks: A Survey
Fenglei Fan, Jinjun Xiong, Mengzhou Li, and Ge Wang

TL;DR
This survey reviews recent research on understanding and interpreting deep neural networks, highlighting their importance for critical applications like medicine and discussing future research directions.
Contribution
It provides a comprehensive taxonomy of interpretability methods, summarizes applications in medicine, and explores future directions including fuzzy logic and brain science.
Findings
Systematic review of interpretability techniques
Applications of interpretability in medical diagnosis
Discussion of future research directions
Abstract
Deep learning as represented by the artificial deep neural networks (DNNs) has achieved great success in many important areas that deal with text, images, videos, graphs, and so on. However, the black-box nature of DNNs has become one of the primary obstacles for their wide acceptance in mission-critical applications such as medical diagnosis and therapy. Due to the huge potential of deep learning, interpreting neural networks has recently attracted much research attention. In this paper, based on our comprehensive taxonomy, we systematically review recent studies in understanding the mechanism of neural networks, describe applications of interpretability especially in medicine, and discuss future directions of interpretability research, such as in relation to fuzzy logic and brain science.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare
MethodsInterpretability
