Chain Association-based Attacking and Shielding Natural Language Processing Systems
Jiacheng Huang, Long Chen

TL;DR
This paper introduces a novel chain association-based adversarial attack on NLP systems, exploiting the comprehension gap between humans and machines, and proposes shielding methods to defend against such attacks.
Contribution
It presents a new attack method using chain association graphs and particle swarm optimization, and explores defense strategies like adversarial training and associative graph recovery.
Findings
NLP models are vulnerable to the proposed attack.
Humans can understand perturbed text better than models.
Shielding methods improve system robustness.
Abstract
Association as a gift enables people do not have to mention something in completely straightforward words and allows others to understand what they intend to refer to. In this paper, we propose a chain association-based adversarial attack against natural language processing systems, utilizing the comprehension gap between humans and machines. We first generate a chain association graph for Chinese characters based on the association paradigm for building search space of potential adversarial examples. Then, we introduce an discrete particle swarm optimization algorithm to search for the optimal adversarial examples. We conduct comprehensive experiments and show that advanced natural language processing models and applications, including large language models, are vulnerable to our attack, while humans appear good at understanding the perturbed text. We also explore two methods,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
