Adversarial Analysis of Natural Language Inference Systems
Tiffany Chien, Jugal Kalita

TL;DR
This paper examines the vulnerabilities of state-of-the-art natural language inference models to adversarial attacks, revealing significant weaknesses especially in syntax understanding, and offers insights for improving robustness.
Contribution
It provides a comprehensive evaluation of model robustness against adversarial datasets and analyzes factors like model size and training methods affecting performance.
Findings
Syntax attacks are highly effective across models.
Models trained on adversarial data show limited transferability.
Larger models and multi-task learning improve robustness.
Abstract
The release of large natural language inference (NLI) datasets like SNLI and MNLI have led to rapid development and improvement of completely neural systems for the task. Most recently, heavily pre-trained, Transformer-based models like BERT and MT-DNN have reached near-human performance on these datasets. However, these standard datasets have been shown to contain many annotation artifacts, allowing models to shortcut understanding using simple fallible heuristics, and still perform well on the test set. So it is no surprise that many adversarial (challenge) datasets have been created that cause models trained on standard datasets to fail dramatically. Although extra training on this data generally improves model performance on just that type of data, transferring that learning to unseen examples is still partial at best. This work evaluates the failures of state-of-the-art models on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest · Linear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece
