Pretrained Transformers Do not Always Improve Robustness

Swaroop Mishra; Bhavdeep Singh Sachdeva; Chitta Baral

arXiv:2210.07663·cs.CL·October 17, 2022·1 cites

Pretrained Transformers Do not Always Improve Robustness

Swaroop Mishra, Bhavdeep Singh Sachdeva, Chitta Baral

PDF

Open Access

TL;DR

Pretrained Transformers, despite their success in Out of Distribution robustness, may be less robust than traditional models when exposed to noisy data, and adversarial filtering does not always enhance their robustness.

Contribution

This study provides an empirical comparison of PT and traditional models under noisy conditions and evaluates the effectiveness of adversarial filtering in improving robustness.

Findings

01

PT are less robust than traditional models on noisy data

02

Adversarial filtering does not always improve PT robustness

03

Noisy data can fool adversarial filtering mechanisms

Abstract

Pretrained Transformers (PT) have been shown to improve Out of Distribution (OOD) robustness than traditional models such as Bag of Words (BOW), LSTMs, Convolutional Neural Networks (CNN) powered by Word2Vec and Glove embeddings. How does the robustness comparison hold in a real world setting where some part of the dataset can be noisy? Do PT also provide more robust representation than traditional models on exposure to noisy data? We perform a comparative study on 10 models and find an empirical evidence that PT provide less robust representation than traditional models on exposure to noisy data. We investigate further and augment PT with an adversarial filtering (AF) mechanism that has been shown to improve OOD generalization. However, increase in generalization does not necessarily increase robustness, as we find that noisy data fools the AF method powered by PT.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications

MethodsGloVe Embeddings