Open Vocabulary Compositional Explanations for Neuron Alignment
Biagio La Rosa, Leilani H. Gilpin

TL;DR
This paper introduces a flexible framework for generating open vocabulary compositional explanations of neuron activations in vision models, enabling analysis with arbitrary concepts beyond predefined datasets.
Contribution
It presents a novel method leveraging open vocabulary semantic segmentation masks to produce compositional explanations without relying on human-annotated datasets.
Findings
Framework outperforms previous methods in quantitative metrics
Enhances human interpretability of neuron explanations
Allows probing neurons with arbitrary concepts and datasets
Abstract
Neurons are the fundamental building blocks of deep neural networks, and their interconnections allow AI to achieve unprecedented results. Motivated by the goal of understanding how neurons encode information, compositional explanations leverage logical relationships between concepts to express the spatial alignment between neuron activations and human knowledge. However, these explanations rely on human-annotated datasets, restricting their applicability to specific domains and predefined concepts. This paper addresses this limitation by introducing a framework for the vision domain that allows users to probe neurons for arbitrary concepts and datasets. Specifically, the framework leverages masks generated by open vocabulary semantic segmentation to compute open vocabulary compositional explanations. The proposed framework consists of three steps: specifying arbitrary concepts,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Materials Science · Advanced Neural Network Applications
