IAE: Irony-based Adversarial Examples for Sentiment Analysis Systems
Xiaoyin Yi, Jiacheng Huang

TL;DR
This paper introduces Irony-based Adversarial Examples (IAE) that transform straightforward sentences into ironic ones to attack sentiment analysis systems, revealing their vulnerability to irony-based adversarial manipulation.
Contribution
We propose a novel irony-based method for generating adversarial text without relying on existing irony corpora, highlighting system vulnerabilities.
Findings
State-of-the-art models' performance drops significantly under IAE attacks.
Humans are less affected by irony in text compared to NLP systems.
IAE effectively exploits irony to create challenging adversarial examples.
Abstract
Adversarial examples, which are inputs deliberately perturbed with imperceptible changes to induce model errors, have raised serious concerns for the reliability and security of deep neural networks (DNNs). While adversarial attacks have been extensively studied in continuous data domains such as images, the discrete nature of text presents unique challenges. In this paper, we propose Irony-based Adversarial Examples (IAE), a method that transforms straightforward sentences into ironic ones to create adversarial text. This approach exploits the rhetorical device of irony, where the intended meaning is opposite to the literal interpretation, requiring a deeper understanding of context to detect. The IAE method is particularly challenging due to the need to accurately locate evaluation words, substitute them with appropriate collocations, and expand the text with suitable ironic elements…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
