I Bet You Did Not Mean That: Testing Semantic Importance via Betting

Jacopo Teneggi; Jeremias Sulam

arXiv:2405.19146·stat.ML·October 8, 2024

I Bet You Did Not Mean That: Testing Semantic Importance via Betting

Jacopo Teneggi, Jeremias Sulam

PDF

Open Access 1 Repo

TL;DR

This paper introduces a statistically rigorous framework for testing the importance of semantic concepts in black-box models, ensuring transparent and reliable interpretation through independence testing and significance control.

Contribution

It formalizes the statistical importance of semantic concepts using conditional independence and applies sequential kernelized independence testing to rank and validate concept importance.

Findings

01

Effective importance ranking on synthetic datasets

02

Successful application to diverse vision-language models

03

Framework provides statistical guarantees for interpretability

Abstract

Recent works have extended notions of feature importance to semantic concepts that are inherently interpretable to the users interacting with a black-box predictive model. Yet, precise statistical guarantees, such as false positive rate and false discovery rate control, are needed to communicate findings transparently and to avoid unintended consequences in real-world scenarios. In this paper, we formalize the global (i.e., over a population) and local (i.e., for a sample) statistical importance of semantic concepts for the predictions of opaque models by means of conditional independence, which allows for rigorous testing. We use recent ideas of sequential kernelized independence testing (SKIT) to induce a rank of importance across concepts, and showcase the effectiveness and flexibility of our framework on synthetic datasets as well as on image classification tasks using several and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Sulam-Group/IBYDMT
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Imbalanced Data Classification Techniques · Software Engineering Research

MethodsContrastive Language-Image Pre-training