Generic Attention-model Explainability for Interpreting Bi-Modal and   Encoder-Decoder Transformers

Hila Chefer; Shir Gur; and Lior Wolf

arXiv:2103.15679·cs.CV·March 30, 2021

Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers

Hila Chefer, Shir Gur, and Lior Wolf

PDF

1 Repo

TL;DR

This paper introduces a universal explainability method for Transformer-based models, including multi-modal and encoder-decoder architectures, improving interpretability over existing single-modality approaches.

Contribution

It presents the first generic explanation technique applicable to various Transformer architectures, including bi-modal and co-attention models.

Findings

01

Outperforms existing explainability methods for Transformers.

02

Effective across self-attention, co-attention, and encoder-decoder models.

03

Enhances interpretability in multi-modal reasoning tasks.

Abstract

Transformers are increasingly dominating multi-modal reasoning tasks, such as visual question answering, achieving state-of-the-art results thanks to their ability to contextualize information using the self-attention and co-attention mechanisms. These attention modules also play a role in other computer vision tasks including object detection and image segmentation. Unlike Transformers that only use self-attention, Transformers with co-attention require to consider multiple attention maps in parallel in order to highlight the information that is relevant to the prediction in the model's input. In this work, we propose the first method to explain prediction by any Transformer-based architecture, including bi-modal Transformers and Transformers with co-attentions. We provide generic solutions and apply these to the three most commonly used of these architectures: (i) pure self-attention,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hila-chefer/Transformer-MM-Explainability
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.