Unsupervised discovery of Interpretable Visual Concepts
Caroline Mazini Rodrigues (LIGM, LRDE), Nicolas Boutry (LRDE), Laurent, Najman (LIGM)

TL;DR
This paper introduces two novel methods, MAGE and Ms-IV, to enhance the interpretability of CNNs by discovering and visualizing semantic concepts, outperforming existing xAI techniques in localization and faithfulness.
Contribution
The paper presents MAGE and Ms-IV, new techniques for unsupervised discovery and visualization of interpretable visual concepts in CNNs, improving global interpretability.
Findings
Ms-IV achieves higher localization accuracy.
The combined approach helps detect biases in networks.
Qualitative analysis shows human agreement with visualized concepts.
Abstract
Providing interpretability of deep-learning models to non-experts, while fundamental for a responsible real-world usage, is challenging. Attribution maps from xAI techniques, such as Integrated Gradients, are a typical example of a visualization technique containing a high level of information, but with difficult interpretation. In this paper, we propose two methods, Maximum Activation Groups Extraction (MAGE) and Multiscale Interpretable Visualization (Ms-IV), to explain the model's decision, enhancing global interpretability. MAGE finds, for a given CNN, combinations of features which, globally, form a semantic meaning, that we call concepts. We group these similar feature patterns by clustering in ``concepts'', that we visualize through Ms-IV. This last method is inspired by Occlusion and Sensitivity analysis (incorporating causality), and uses a novel metric, called Class-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Neural Networks and Applications
MethodsLocal Interpretable Model-Agnostic Explanations
