SEAT: Stable and Explainable Attention

Lijie Hu; Yixin Liu; Ninghao Liu; Mengdi Huai; Lichao Sun; Di Wang

arXiv:2211.13290·cs.CL·November 28, 2022

SEAT: Stable and Explainable Attention

Lijie Hu, Yixin Liu, Ninghao Liu, Mengdi Huai, Lichao Sun, Di Wang

PDF

Open Access 1 Video

TL;DR

This paper introduces SEAT, a new attention mechanism that is stable against randomness and perturbations, providing more reliable explanations in NLP models without sacrificing accuracy.

Contribution

The paper proposes SEAT, a novel stable and explainable attention mechanism, with a rigorous definition and empirical validation across multiple models and datasets.

Findings

01

SEAT is more stable against perturbations and randomness.

02

SEAT maintains explainability comparable to vanilla attention.

03

SEAT does not degrade model accuracy.

Abstract

Currently, attention mechanism becomes a standard fixture in most state-of-the-art natural language processing (NLP) models, not only due to outstanding performance it could gain, but also due to plausible innate explanation for the behaviors of neural architectures it provides, which is notoriously difficult to analyze. However, recent studies show that attention is unstable against randomness and perturbations during training or testing, such as random seeds and slight perturbation of embedding vectors, which impedes it from becoming a faithful explanation tool. Thus, a natural question is whether we can find some substitute of the current attention which is more stable and could keep the most important characteristics on explanation and prediction of attention. In this paper, to resolve the problem, we provide a first rigorous definition of such alternate namely SEAT (Stable and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

SEAT: Stable and Explainable Attention· underline

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Adversarial Robustness in Machine Learning

MethodsMulti-Head Attention · Attention Is All You Need · Sigmoid Activation · Tanh Activation · Linear Layer · Weight Decay · Dense Connections · Residual Connection · Layer Normalization · WordPiece