Interpreting Deep Visual Representations via Network Dissection
Bolei Zhou, David Bau, Aude Oliva, and Antonio Torralba

TL;DR
Network Dissection is a method that interprets deep CNN representations by labeling individual units with human-understandable concepts, revealing their transparency and aiding explanation of network decisions.
Contribution
The paper introduces Network Dissection, a novel approach to interpret CNN units by aligning them with semantic concepts, enhancing understanding of deep visual representations.
Findings
Deep representations are more interpretable than random baselines.
Interpretability varies with training factors like iterations and regularizations.
Interpreted units can explain CNN predictions explicitly.
Abstract
The success of recent deep convolutional neural networks (CNNs) depends on learning hidden representations that can summarize the important factors of variation behind the data. However, CNNs often criticized as being black boxes that lack interpretability, since they have millions of unexplained model parameters. In this work, we describe Network Dissection, a method that interprets networks by providing labels for the units of their deep visual representations. The proposed method quantifies the interpretability of CNN representations by evaluating the alignment between individual hidden units and a set of visual semantic concepts. By identifying the best alignments, units are given human interpretable labels across a range of objects, parts, scenes, textures, materials, and colors. The method reveals that deep representations are more transparent and interpretable than expected: we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Cell Image Analysis Techniques · Machine Learning in Materials Science
MethodsNetwork Dissection · Interpretability
