Interpreting Deep Visual Representations via Network Dissection

Bolei Zhou; David Bau; Aude Oliva; and Antonio Torralba

arXiv:1711.05611·cs.CV·June 27, 2018·45 cites

Interpreting Deep Visual Representations via Network Dissection

Bolei Zhou, David Bau, Aude Oliva, and Antonio Torralba

PDF

Open Access 3 Repos

TL;DR

Network Dissection is a method that interprets deep CNN representations by labeling individual units with human-understandable concepts, revealing their transparency and aiding explanation of network decisions.

Contribution

The paper introduces Network Dissection, a novel approach to interpret CNN units by aligning them with semantic concepts, enhancing understanding of deep visual representations.

Findings

01

Deep representations are more interpretable than random baselines.

02

Interpretability varies with training factors like iterations and regularizations.

03

Interpreted units can explain CNN predictions explicitly.

Abstract

The success of recent deep convolutional neural networks (CNNs) depends on learning hidden representations that can summarize the important factors of variation behind the data. However, CNNs often criticized as being black boxes that lack interpretability, since they have millions of unexplained model parameters. In this work, we describe Network Dissection, a method that interprets networks by providing labels for the units of their deep visual representations. The proposed method quantifies the interpretability of CNN representations by evaluating the alignment between individual hidden units and a set of visual semantic concepts. By identifying the best alignments, units are given human interpretable labels across a range of objects, parts, scenes, textures, materials, and colors. The method reveals that deep representations are more transparent and interpretable than expected: we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Cell Image Analysis Techniques · Machine Learning in Materials Science

MethodsNetwork Dissection · Interpretability