Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via   Emoji Sequences

Yangshijie Zhang

arXiv:2502.17392·cs.AI·February 25, 2025

Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences

Yangshijie Zhang

PDF

Open Access

TL;DR

This paper introduces Emoji-Attack, a novel adversarial attack method on NLP models that manipulates emojis to create subtle perturbations, revealing vulnerabilities with minimal perceptibility and high effectiveness.

Contribution

The paper presents a new emoji-based adversarial attack technique that is more efficient and less perceptible than previous text attack methods, expanding the scope of adversarial strategies in NLP.

Findings

01

Emoji-Attack achieves high success rates on various NLP models.

02

The method requires fewer queries than traditional attacks.

03

It maintains semantic similarity while perturbing emojis.

Abstract

Deep neural networks (DNNs) have achieved remarkable success in the field of natural language processing (NLP), leading to widely recognized applications such as ChatGPT. However, the vulnerability of these models to adversarial attacks remains a significant concern. Unlike continuous domains like images, text exists in a discrete space, making even minor alterations at the sentence, word, or character level easily perceptible to humans. This inherent discreteness also complicates the use of conventional optimization techniques, as text is non-differentiable. Previous research on adversarial attacks in text has focused on character-level, word-level, sentence-level, and multi-level approaches, all of which suffer from inefficiency or perceptibility issues due to the need for multiple queries or significant semantic shifts. In this work, we introduce a novel adversarial attack method,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Advanced Malware Detection Techniques · Adversarial Robustness in Machine Learning