Chain Association-based Attacking and Shielding Natural Language   Processing Systems

Jiacheng Huang; Long Chen

arXiv:2411.07843·cs.CL·November 13, 2024

Chain Association-based Attacking and Shielding Natural Language Processing Systems

Jiacheng Huang, Long Chen

PDF

Open Access

TL;DR

This paper introduces a novel chain association-based adversarial attack on NLP systems, exploiting the comprehension gap between humans and machines, and proposes shielding methods to defend against such attacks.

Contribution

It presents a new attack method using chain association graphs and particle swarm optimization, and explores defense strategies like adversarial training and associative graph recovery.

Findings

01

NLP models are vulnerable to the proposed attack.

02

Humans can understand perturbed text better than models.

03

Shielding methods improve system robustness.

Abstract

Association as a gift enables people do not have to mention something in completely straightforward words and allows others to understand what they intend to refer to. In this paper, we propose a chain association-based adversarial attack against natural language processing systems, utilizing the comprehension gap between humans and machines. We first generate a chain association graph for Chinese characters based on the association paradigm for building search space of potential adversarial examples. Then, we introduce an discrete particle swarm optimization algorithm to search for the optimal adversarial examples. We conduct comprehensive experiments and show that advanced natural language processing models and applications, including large language models, are vulnerable to our attack, while humans appear good at understanding the perturbed text. We also explore two methods,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning