Adversarial Attacks on Deep Learning Models in Natural Language   Processing: A Survey

Wei Emma Zhang; Quan Z. Sheng; Ahoud Alhazmi; and Chenliang Li

arXiv:1901.06796·cs.CL·April 12, 2019·258 cites

Adversarial Attacks on Deep Learning Models in Natural Language Processing: A Survey

Wei Emma Zhang, Quan Z. Sheng, Ahoud Alhazmi, and Chenliang Li

PDF

Open Access 1 Repo

TL;DR

This survey reviews the development of adversarial attacks on deep learning models in natural language processing, highlighting unique challenges and methods for generating textual adversarial examples.

Contribution

It provides a comprehensive overview of existing research on textual adversarial attacks, emphasizing differences from image-based methods and offering insights for future work.

Findings

01

Textual adversarial attack methods are distinct due to data discreteness.

02

Various techniques for generating adversarial examples in NLP are summarized.

03

The survey discusses challenges and future directions in this research area.

Abstract

With the development of high computational devices, deep neural networks (DNNs), in recent years, have gained significant popularity in many Artificial Intelligence (AI) applications. However, previous efforts have shown that DNNs were vulnerable to strategically modified samples, named adversarial examples. These samples are generated with some imperceptible perturbations but can fool the DNNs to give false predictions. Inspired by the popularity of generating adversarial examples for image DNNs, research efforts on attacking DNNs for textual applications emerges in recent years. However, existing perturbation methods for images cannotbe directly applied to texts as text data is discrete. In this article, we review research works that address this difference and generatetextual adversarial examples on DNNs. We collect, select, summarize, discuss and analyze these works in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lihebi/biber
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications