Open Vocabulary Compositional Explanations for Neuron Alignment

Biagio La Rosa; Leilani H. Gilpin

arXiv:2511.20931·cs.CV·November 27, 2025

Open Vocabulary Compositional Explanations for Neuron Alignment

Biagio La Rosa, Leilani H. Gilpin

PDF

Open Access

TL;DR

This paper introduces a flexible framework for generating open vocabulary compositional explanations of neuron activations in vision models, enabling analysis with arbitrary concepts beyond predefined datasets.

Contribution

It presents a novel method leveraging open vocabulary semantic segmentation masks to produce compositional explanations without relying on human-annotated datasets.

Findings

01

Framework outperforms previous methods in quantitative metrics

02

Enhances human interpretability of neuron explanations

03

Allows probing neurons with arbitrary concepts and datasets

Abstract

Neurons are the fundamental building blocks of deep neural networks, and their interconnections allow AI to achieve unprecedented results. Motivated by the goal of understanding how neurons encode information, compositional explanations leverage logical relationships between concepts to express the spatial alignment between neuron activations and human knowledge. However, these explanations rely on human-annotated datasets, restricting their applicability to specific domains and predefined concepts. This paper addresses this limitation by introducing a framework for the vision domain that allows users to probe neurons for arbitrary concepts and datasets. Specifically, the framework leverages masks generated by open vocabulary semantic segmentation to compute open vocabulary compositional explanations. The proposed framework consists of three steps: specifying arbitrary concepts,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Materials Science · Advanced Neural Network Applications