Visual Interpretability for Deep Learning: a Survey
Quanshi Zhang, Song-Chun Zhu

TL;DR
This survey reviews recent advances in understanding and improving the interpretability of deep neural networks, especially CNNs, highlighting visualization, diagnosis, disentanglement, and future trends in explainable AI.
Contribution
It provides a comprehensive overview of methods for interpreting CNN representations and discusses future directions in explainable deep learning.
Findings
Visualization techniques help interpret CNN features
Disentangling representations improves model transparency
Future trends include explainable AI development
Abstract
This paper reviews recent studies in understanding neural-network representations and learning neural networks with interpretable/disentangled middle-layer representations. Although deep neural networks have exhibited superior performance in various tasks, the interpretability is always the Achilles' heel of deep neural networks. At present, deep neural networks obtain high discrimination power at the cost of low interpretability of their black-box representations. We believe that high model interpretability may help people to break several bottlenecks of deep learning, e.g., learning from very few annotations, learning via human-computer communications at the semantic level, and semantically debugging network representations. We focus on convolutional neural networks (CNNs), and we revisit the visualization of CNN representations, methods of diagnosing representations of pre-trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
MethodsInterpretability
