BERT is Robust! A Case Against Synonym-Based Adversarial Examples in   Text Classification

Jens Hauser; Zhao Meng; Dami\'an Pascual; Roger Wattenhofer

arXiv:2109.07403·cs.CL·September 16, 2021·1 cites

BERT is Robust! A Case Against Synonym-Based Adversarial Examples in Text Classification

Jens Hauser, Zhao Meng, Dami\'an Pascual, Roger Wattenhofer

PDF

Open Access

TL;DR

This paper challenges the perceived vulnerability of BERT to synonym-based adversarial attacks, showing many attacks do not preserve semantics and that BERT is more robust than previously thought.

Contribution

It demonstrates that most synonym-based attacks fail to preserve semantics and introduces data augmentation and post-processing methods to significantly improve BERT's robustness.

Findings

01

96-99% of attacks do not preserve semantics

02

Data augmentation reduces attack success below 5%

03

BERT is more robust than prior attack studies suggest

Abstract

Deep Neural Networks have taken Natural Language Processing by storm. While this led to incredible improvements across many tasks, it also initiated a new research field, questioning the robustness of these neural networks by attacking them. In this paper, we investigate four word substitution-based attacks on BERT. We combine a human evaluation of individual word substitutions and a probabilistic analysis to show that between 96% and 99% of the analyzed attacks do not preserve semantics, indicating that their success is mainly based on feeding poor data to the model. To further confirm that, we introduce an efficient data augmentation procedure and show that many adversarial examples can be prevented by including data similar to the attacks during training. An additional post-processing step reduces the success rates of state-of-the-art attacks below 5%. Finally, by looking at more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Software Engineering Research

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Adam · Residual Connection · Multi-Head Attention · Softmax