Exploiting Text-Image Latent Spaces for the Description of Visual Concepts
Laines Schmalwasser, Jakob Gawlikowski, Joachim Denzler, Julia, Niebling

TL;DR
This paper presents a method to generate textual descriptions for Concept Activation Vectors in neural networks by mapping relevant images into a joint text-image embedding, aiding interpretability.
Contribution
It introduces a novel approach that uses receptive fields and text-image embeddings to automatically describe CAVs, improving concept interpretability in neural networks.
Findings
The method accurately describes CAVs in various experiments.
It reduces the effort needed for human interpretation of concepts.
The approach works with and without pre-labeled CAVs.
Abstract
Concept Activation Vectors (CAVs) offer insights into neural network decision-making by linking human friendly concepts to the model's internal feature extraction process. However, when a new set of CAVs is discovered, they must still be translated into a human understandable description. For image-based neural networks, this is typically done by visualizing the most relevant images of a CAV, while the determination of the concept is left to humans. In this work, we introduce an approach to aid the interpretation of newly discovered concept sets by suggesting textual descriptions for each CAV. This is done by mapping the most relevant images representing a CAV into a text-image embedding where a joint description of these relevant images can be computed. We propose utilizing the most relevant receptive fields instead of full images encoded. We demonstrate the capabilities of this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Image Retrieval and Classification Techniques · Video Analysis and Summarization
MethodsSparse Evolutionary Training
