Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations   for Vision Foundation Models

Hengyi Wang; Shiwei Tan; Hao Wang

arXiv:2406.12649·cs.LG·November 4, 2024

Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models

Hengyi Wang, Shiwei Tan, Hao Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces PACE, a probabilistic framework for providing trustworthy, multi-level conceptual explanations of vision transformers by modeling patch embedding distributions, addressing current explanation method shortcomings.

Contribution

The paper proposes PACE, a novel variational Bayesian approach that models patch embedding distributions for more faithful and stable explanations of ViT predictions.

Findings

01

PACE outperforms existing methods on synthetic datasets

02

It provides more faithful and stable explanations

03

It bridges image-level and dataset-level explanations

Abstract

Vision transformers (ViTs) have emerged as a significant area of focus, particularly for their capacity to be jointly trained with large language models and to serve as robust vision foundation models. Yet, the development of trustworthy explanation methods for ViTs has lagged, particularly in the context of post-hoc interpretations of ViT predictions. Existing sub-image selection approaches, such as feature-attribution and conceptual models, fall short in this regard. This paper proposes five desiderata for explaining ViTs -- faithfulness, stability, sparsity, multi-level structure, and parsimony -- and demonstrates the inadequacy of current methods in meeting these criteria comprehensively. We introduce a variational Bayesian explanation framework, dubbed ProbAbilistic Concept Explainers (PACE), which models the distributions of patch embeddings to provide trustworthy post-hoc…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Wang-ML-Lab/interpretable-foundation-models
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI)