Better Robustness by More Coverage: Adversarial Training with Mixup   Augmentation for Robust Fine-tuning

Chenglei Si; Zhengyan Zhang; Fanchao Qi; Zhiyuan Liu; Yasheng Wang,; Qun Liu; Maosong Sun

arXiv:2012.15699·cs.CL·June 8, 2021·20 cites

Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuning

Chenglei Si, Zhengyan Zhang, Fanchao Qi, Zhiyuan Liu, Yasheng Wang,, Qun Liu, Maosong Sun

PDF

Open Access 1 Repo

TL;DR

This paper introduces AMDA, a novel data augmentation method combining adversarial training with mixup to enhance the robustness of pretrained language models against adversarial attacks, showing significant improvements in experiments.

Contribution

The paper proposes AMDA, a new augmentation technique that interpolates between training samples to better cover attack space and improve model robustness.

Findings

01

AMDA significantly improves adversarial robustness of BERT and RoBERTa.

02

AMDA alleviates performance degradation on clean data.

03

The method outperforms traditional adversarial data augmentation.

Abstract

Pretrained language models (PLMs) perform poorly under adversarial attacks. To improve the adversarial robustness, adversarial data augmentation (ADA) has been widely adopted to cover more search space of adversarial attacks by adding textual adversarial examples during training. However, the number of adversarial examples for text augmentation is still extremely insufficient due to the exponentially large attack search space. In this work, we propose a simple and effective method to cover a much larger proportion of the attack search space, called Adversarial and Mixup Data Augmentation (AMDA). Specifically, AMDA linearly interpolates the representations of pairs of training samples to form new virtual samples, which are more abundant and diverse than the discrete text adversarial examples in conventional ADA. Moreover, to fairly evaluate the robustness of different models, we adopt a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thunlp/MixADA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Natural Language Processing Techniques

MethodsAdaptive Discriminator Augmentation · Linear Layer · Dropout · Softmax · Linear Warmup With Linear Decay · Dense Connections · Attention Dropout · Attention Is All You Need · Layer Normalization · WordPiece