SCAT: Robust Self-supervised Contrastive Learning via Adversarial   Training for Text Classification

Junjie Wu; Dit-Yan Yeung

arXiv:2307.01488·cs.CL·July 6, 2023

SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification

Junjie Wu, Dit-Yan Yeung

PDF

Open Access

TL;DR

SCAT introduces a label-free adversarial training framework using contrastive learning to enhance the robustness of NLP models against textual adversarial attacks, applicable to both training from scratch and improving pre-trained models.

Contribution

The paper presents SCAT, a novel self-supervised adversarial training method that does not rely on labeled data, improving robustness of NLP models against adversarial attacks.

Findings

01

SCAT effectively trains robust models from scratch.

02

It significantly enhances robustness of pre-trained language models.

03

Combining SCAT with supervised adversarial training yields further robustness improvements.

Abstract

Despite their promising performance across various natural language processing (NLP) tasks, current NLP systems are vulnerable to textual adversarial attacks. To defend against these attacks, most existing methods apply adversarial training by incorporating adversarial examples. However, these methods have to rely on ground-truth labels to generate adversarial examples, rendering it impractical for large-scale model pre-training which is commonly used nowadays for NLP and many other tasks. In this paper, we propose a novel learning framework called SCAT (Self-supervised Contrastive Learning via Adversarial Training), which can learn robust representations without requiring labeled data. Specifically, SCAT modifies random augmentations of the data in a fully labelfree manner to generate adversarial examples. Adversarial training is achieved by minimizing the contrastive loss between the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling

MethodsContrastive Learning