Adversarial Self-Attention for Language Understanding

Hongqiu Wu; Ruixue Ding; Hai Zhao; Pengjun Xie; Fei Huang; and Min Zhang

arXiv:2206.12608·cs.CL·February 9, 2023·1 cites

Adversarial Self-Attention for Language Understanding

Hongqiu Wu, Ruixue Ding, Hai Zhao, Pengjun Xie, Fei Huang, and Min Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Adversarial Self-Attention (ASA), a novel mechanism that enhances Transformer models by reducing reliance on spurious features, thereby improving their robustness and generalization across various language understanding tasks.

Contribution

It proposes ASA, an adversarially biased self-attention mechanism that suppresses reliance on specific features and promotes broader semantic exploration in Transformer models.

Findings

01

ASA improves pre-training performance over naive training.

02

ASA-empowered models outperform naive models in generalization.

03

Enhanced robustness and robustness in language understanding tasks.

Abstract

Deep neural models (e.g. Transformer) naturally learn spurious features, which create a ``shortcut'' between the labels and inputs, thus impairing the generalization and robustness. This paper advances the self-attention mechanism to its robust variant for Transformer-based pre-trained language models (e.g. BERT). We propose \textit{Adversarial Self-Attention} mechanism (ASA), which adversarially biases the attentions to effectively suppress the model reliance on features (e.g. specific keywords) and encourage its exploration of broader semantics. We conduct a comprehensive evaluation across a wide range of tasks for both pre-training and fine-tuning stages. For pre-training, ASA unfolds remarkable performance gains compared to naive training for longer steps. For fine-tuning, ASA-empowered models outweigh naive models by a large margin considering both generalization and robustness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gingasan/adversarialsa
pytorchOfficial

Videos

Adversarial Self-Attention for Language Understanding· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Adam · Label Smoothing · Dropout · Byte Pair Encoding · Layer Normalization · Position-Wise Feed-Forward Layer