Visual Interpretability for Deep Learning: a Survey

Quanshi Zhang; Song-Chun Zhu

arXiv:1802.00614·cs.CV·February 8, 2018·32 cites

Visual Interpretability for Deep Learning: a Survey

Quanshi Zhang, Song-Chun Zhu

PDF

Open Access 1 Repo

TL;DR

This survey reviews recent advances in understanding and improving the interpretability of deep neural networks, especially CNNs, highlighting visualization, diagnosis, disentanglement, and future trends in explainable AI.

Contribution

It provides a comprehensive overview of methods for interpreting CNN representations and discusses future directions in explainable deep learning.

Findings

01

Visualization techniques help interpret CNN features

02

Disentangling representations improves model transparency

03

Future trends include explainable AI development

Abstract

This paper reviews recent studies in understanding neural-network representations and learning neural networks with interpretable/disentangled middle-layer representations. Although deep neural networks have exhibited superior performance in various tasks, the interpretability is always the Achilles' heel of deep neural networks. At present, deep neural networks obtain high discrimination power at the cost of low interpretability of their black-box representations. We believe that high model interpretability may help people to break several bottlenecks of deep learning, e.g., learning from very few annotations, learning via human-computer communications at the semantic level, and semantically debugging network representations. We focus on convolutional neural networks (CNNs), and we revisit the visualization of CNN representations, methods of diagnosing representations of pre-trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JepsonWong/CNN_Visualization
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

MethodsInterpretability