Semantically Interpretable Activation Maps: what-where-how explanations within CNNs
Diego Marcos, Sylvain Lobry, Devis Tuia

TL;DR
This paper introduces Semantically Interpretable Activation Maps (SIAM), an intermediate representation for CNNs that provides transparent, attribute-based explanations of image decisions, enhancing interpretability without sacrificing performance.
Contribution
The paper proposes SIAM, a novel method that combines attribute maps for better interpretability of CNN decisions, applicable to multiple datasets without extra annotation or computational cost.
Findings
SIAM enables understanding of attribute contributions and locations in images.
SIAM improves interpretability while maintaining high prediction accuracy.
The method works effectively on landscape scenicness estimation using 33 attributes.
Abstract
A main issue preventing the use of Convolutional Neural Networks (CNN) in end user applications is the low level of transparency in the decision process. Previous work on CNN interpretability has mostly focused either on localizing the regions of the image that contribute to the result or on building an external model that generates plausible explanations. However, the former does not provide any semantic information and the latter does not guarantee the faithfulness of the explanation. We propose an intermediate representation composed of multiple Semantically Interpretable Activation Maps (SIAM) indicating the presence of predefined attributes at different locations of the image. These attribute maps are then linearly combined to produce the final output. This gives the user insight into what the model has seen, where, and a final output directly linked to this information in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsInterpretability
