On the Performance of Concept Probing: The Influence of the Data (Extended Version)

Manuel de Sousa Ribeiro; Afonso Leote; Jo\~ao Leite

arXiv:2507.18550·cs.AI·July 25, 2025

On the Performance of Concept Probing: The Influence of the Data (Extended Version)

Manuel de Sousa Ribeiro, Afonso Leote, Jo\~ao Leite

PDF

Open Access

TL;DR

This paper investigates how the choice and amount of data influence the effectiveness of concept probing in interpreting neural networks, specifically in image classification, and provides new concept labels for popular datasets.

Contribution

It explores the impact of training data on concept probing performance and offers publicly available concept labels for key datasets.

Findings

01

Data quality and quantity significantly affect probing accuracy

02

Probing models trained on more representative data perform better

03

New concept labels for standard datasets are introduced

Abstract

Concept probing has recently garnered increasing interest as a way to help interpret artificial neural networks, dealing both with their typically large size and their subsymbolic nature, which ultimately renders them unfeasible for direct human interpretation. Concept probing works by training additional classifiers to map the internal representations of a model into human-defined concepts of interest, thus allowing humans to peek inside artificial neural networks. Research on concept probing has mainly focused on the model being probed or the probing model itself, paying limited attention to the data required to train such probing models. In this paper, we address this gap. Focusing on concept probing in the context of image classification tasks, we investigate the effect of the data used to train probing models on their performance. We also make available concept labels for two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Mining Algorithms and Applications · Data Stream Mining Techniques · Machine Learning and Data Classification