An Attention Matrix for Every Decision: Faithfulness-based Arbitration   Among Multiple Attention-Based Interpretations of Transformers in Text   Classification

Nikolaos Mylonas; Ioannis Mollas; Grigorios Tsoumakas

arXiv:2209.10876·cs.CL·November 29, 2022·1 cites

An Attention Matrix for Every Decision: Faithfulness-based Arbitration Among Multiple Attention-Based Interpretations of Transformers in Text Classification

Nikolaos Mylonas, Ioannis Mollas, Grigorios Tsoumakas

PDF

Open Access

TL;DR

This paper introduces a faithfulness-based arbitration method to select the most interpretable attention-based explanation in transformer models for text classification, improving interpretability and efficiency.

Contribution

It proposes a novel arbitration technique for selecting the most faithful attention interpretation, along with two efficiency-enhancing variations and a new faithfulness metric for transformers.

Findings

01

The method effectively identifies the most faithful attention interpretation.

02

The proposed variations reduce computational complexity and improve multi-label performance.

03

The new faithfulness metric correlates well with ground truth rationales.

Abstract

Transformers are widely used in natural language processing, where they consistently achieve state-of-the-art performance. This is mainly due to their attention-based architecture, which allows them to model rich linguistic relations between (sub)words. However, transformers are difficult to interpret. Being able to provide reasoning for its decisions is an important property for a model in domains where human lives are affected. With transformers finding wide use in such fields, the need for interpretability techniques tailored to them arises. We propose a new technique that selects the most faithful attention-based interpretation among the several ones that can be obtained by combining different head, layer and matrix operations. In addition, two variations are introduced towards (i) reducing the computational complexity, thus being faster and friendlier to the environment, and (ii)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies · Computational and Text Analysis Methods