Interpretable Network Visualizations: A Human-in-the-Loop Approach for   Post-hoc Explainability of CNN-based Image Classification

Matteo Bianchi; Antonio De Santis; Andrea Tocchetti; Marco; Brambilla

arXiv:2405.03301·cs.LG·July 30, 2024

Interpretable Network Visualizations: A Human-in-the-Loop Approach for Post-hoc Explainability of CNN-based Image Classification

Matteo Bianchi, Antonio De Santis, Andrea Tocchetti, Marco, Brambilla

PDF

1 Repo

TL;DR

This paper presents a human-in-the-loop, post-hoc explainability method for CNNs that visualizes layer-wise features, incorporates crowdsourced labels, and offers global explanations to improve transparency in image classification.

Contribution

It introduces a novel approach combining saliency map clustering, crowdsourced textual labels, and aggregation techniques for comprehensive CNN interpretability.

Findings

01

Layer-wise feature explanations improve model transparency

02

Crowdsourced labels enhance interpretability with human insights

03

Global explanations help understand model behavior across datasets

Abstract

Transparency and explainability in image classification are essential for establishing trust in machine learning models and detecting biases and errors. State-of-the-art explainability methods generate saliency maps to show where a specific class is identified, without providing a detailed explanation of the model's decision process. Striving to address such a need, we introduce a post-hoc method that explains the entire feature extraction process of a Convolutional Neural Network. These explanations include a layer-wise representation of the features the model extracts from the input. Such features are represented as saliency maps generated by clustering and merging similar feature maps, to which we associate a weight derived by generalizing Grad-CAM for the proposed methodology. To further enhance these explanations, we include a set of textual labels collected through a gamified…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Antonio-Dee/interpretable-network-visualizations
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training