Inpainting the Gaps: A Novel Framework for Evaluating Explanation   Methods in Vision Transformers

Lokesh Badisa; Sumohana S. Channappayya

arXiv:2406.11534·cs.CV·June 18, 2024

Inpainting the Gaps: A Novel Framework for Evaluating Explanation Methods in Vision Transformers

Lokesh Badisa, Sumohana S. Channappayya

PDF

Open Access

TL;DR

This paper introduces InG, a new semi-synthetic evaluation framework for explanation methods in Vision Transformers that reduces test-time distribution shift by inpainting image gaps, leading to more reliable assessments.

Contribution

The paper proposes InG, a novel inpainting-based evaluation framework for ViT explanation methods, addressing limitations of perturbation tests and providing more consistent evaluation scores.

Findings

01

InG improves the reliability of explanation method evaluation.

02

Beyond Intuition and Generic Attribution are the most consistent explanation models.

03

InG yields higher and more stable evaluation scores across ViT models.

Abstract

The perturbation test remains the go-to evaluation approach for explanation methods in computer vision. This evaluation method has a major drawback of test-time distribution shift due to pixel-masking that is not present in the training set. To overcome this drawback, we propose a novel evaluation framework called \textbf{Inpainting the Gaps (InG)}. Specifically, we propose inpainting parts that constitute partial or complete objects in an image. In this way, one can perform meaningful image perturbations with lower test-time distribution shifts, thereby improving the efficacy of the perturbation test. InG is applied to the PartImageNet dataset to evaluate the performance of popular explanation methods for three training strategies of the Vision Transformer (ViT). Based on this evaluation, we found Beyond Intuition and Generic Attribution to be the two most consistent explanation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning

MethodsResidual Connection · Softmax · Inpainting · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Attention Is All You Need · Linear Layer · Multi-Head Attention