Adversarial Text Generation with Dynamic Contextual Perturbation
Hetvi Waghela, Jaydip Sen, Sneha Rakshit, Subhasis Dasgupta

TL;DR
This paper introduces Dynamic Contextual Perturbation (DCP), a novel adversarial text attack method that dynamically generates context-aware perturbations to challenge NLP models while maintaining natural language semantics.
Contribution
The paper presents a new context-aware adversarial attack scheme that leverages pre-trained language models to produce more natural and effective adversarial examples across entire texts.
Findings
DCP outperforms existing methods in fooling NLP models.
DCP maintains high semantic fidelity and fluency.
Adversarial examples generated by DCP are more subtle and impactful.
Abstract
Adversarial attacks on Natural Language Processing (NLP) models expose vulnerabilities by introducing subtle perturbations to input text, often leading to misclassification while maintaining human readability. Existing methods typically focus on word-level or local text segment alterations, overlooking the broader context, which results in detectable or semantically inconsistent perturbations. We propose a novel adversarial text attack scheme named Dynamic Contextual Perturbation (DCP). DCP dynamically generates context-aware perturbations across sentences, paragraphs, and documents, ensuring semantic fidelity and fluency. Leveraging the capabilities of pre-trained language models, DCP iteratively refines perturbations through an adversarial objective function that balances the dual objectives of inducing model misclassification and preserving the naturalness of the text. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
