Efficient Combinatorial Optimization for Word-level Adversarial Textual Attack
Shengcai Liu, Ning Lu, Cheng Chen, Ke Tang

TL;DR
This paper introduces a theoretically grounded, efficient local search algorithm for word-level adversarial attacks in NLP, significantly reducing query complexity and improving attack quality across multiple tasks and models.
Contribution
It uncovers the theoretical properties of the attack optimization problem and proposes the first provably approximate local search algorithm for it.
Findings
Reduces query count by an order of magnitude
Achieves high attack success rates
Produces higher quality adversarial examples
Abstract
Over the past few years, various word-level textual attack approaches have been proposed to reveal the vulnerability of deep neural networks used in natural language processing. Typically, these approaches involve an important optimization step to determine which substitute to be used for each word in the original input. However, current research on this step is still rather limited, from the perspectives of both problem-understanding and problem-solving. In this paper, we address these issues by uncovering the theoretical properties of the problem and proposing an efficient local search algorithm (LS) to solve it. We establish the first provable approximation guarantee on solving the problem in general cases.Extensive experiments involving 5 NLP tasks, 8 datasets and 26 NLP models show that LS can largely reduce the number of queries usually by an order of magnitude to achieve high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Topic Modeling
