Saliency Attention and Semantic Similarity-Driven Adversarial Perturbation
Hetvi Waghela, Jaydip Sen, Sneha Rakshit

TL;DR
This paper presents SASSP, an advanced adversarial attack method for NLP that combines saliency, attention, and semantic similarity to generate effective, semantically consistent adversarial examples.
Contribution
The paper introduces SASSP, a novel adversarial attack approach that integrates saliency, attention, and semantic similarity to improve attack success and semantic preservation.
Findings
SASSP achieves higher attack success rates than previous methods.
SASSP maintains semantic fidelity in adversarial examples.
SASSP reduces the number of words perturbed while remaining effective.
Abstract
In this paper, we introduce an enhanced textual adversarial attack method, known as Saliency Attention and Semantic Similarity driven adversarial Perturbation (SASSP). The proposed scheme is designed to improve the effectiveness of contextual perturbations by integrating saliency, attention, and semantic similarity. Traditional adversarial attack methods often struggle to maintain semantic consistency and coherence while effectively deceiving target models. Our proposed approach addresses these challenges by incorporating a three-pronged strategy for word selection and perturbation. First, we utilize a saliency-based word selection to prioritize words for modification based on their importance to the model's prediction. Second, attention mechanisms are employed to focus perturbations on contextually significant words, enhancing the attack's efficacy. Finally, an advanced semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsSoftmax · Attention Is All You Need · Focus
