SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification
Junjie Wu, Dit-Yan Yeung

TL;DR
SCAT introduces a label-free adversarial training framework using contrastive learning to enhance the robustness of NLP models against textual adversarial attacks, applicable to both training from scratch and improving pre-trained models.
Contribution
The paper presents SCAT, a novel self-supervised adversarial training method that does not rely on labeled data, improving robustness of NLP models against adversarial attacks.
Findings
SCAT effectively trains robust models from scratch.
It significantly enhances robustness of pre-trained language models.
Combining SCAT with supervised adversarial training yields further robustness improvements.
Abstract
Despite their promising performance across various natural language processing (NLP) tasks, current NLP systems are vulnerable to textual adversarial attacks. To defend against these attacks, most existing methods apply adversarial training by incorporating adversarial examples. However, these methods have to rely on ground-truth labels to generate adversarial examples, rendering it impractical for large-scale model pre-training which is commonly used nowadays for NLP and many other tasks. In this paper, we propose a novel learning framework called SCAT (Self-supervised Contrastive Learning via Adversarial Training), which can learn robust representations without requiring labeled data. Specifically, SCAT modifies random augmentations of the data in a fully labelfree manner to generate adversarial examples. Adversarial training is achieved by minimizing the contrastive loss between the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Topic Modeling
MethodsContrastive Learning
