Adversarial Training Methods for Semi-Supervised Text Classification

Takeru Miyato; Andrew M. Dai; Ian Goodfellow

arXiv:1605.07725·stat.ML·November 17, 2021·201 cites

Adversarial Training Methods for Semi-Supervised Text Classification

Takeru Miyato, Andrew M. Dai, Ian Goodfellow

PDF

Open Access 4 Repos

TL;DR

This paper adapts adversarial training techniques for semi-supervised text classification by applying perturbations to word embeddings, leading to improved performance and embedding quality in neural networks.

Contribution

It introduces a novel method of applying adversarial training to word embeddings in RNNs, enhancing semi-supervised text classification.

Findings

01

Achieved state-of-the-art results on benchmark tasks.

02

Learned embeddings show improved quality.

03

Model exhibits reduced overfitting during training.

Abstract

Adversarial training provides a means of regularizing supervised learning algorithms while virtual adversarial training is able to extend supervised learning algorithms to the semi-supervised setting. However, both methods require making small perturbations to numerous entries of the input vector, which is inappropriate for sparse high-dimensional inputs such as one-hot word representations. We extend adversarial and virtual adversarial training to the text domain by applying perturbations to the word embeddings in a recurrent neural network rather than to the original input itself. The proposed method achieves state of the art results on multiple benchmark semi-supervised and purely supervised tasks. We provide visualizations and analysis showing that the learned word embeddings have improved in quality and that while training, the model is less prone to overfitting. Code is available…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis