EmoRAG: Evaluating RAG Robustness to Symbolic Perturbations

Xinyun Zhou; Xinfeng Li; Yinan Peng; Ming Xu; Xuanwang Zhang; Miao Yu; Yidong Wang; Xiaojun Jia; Kun Wang; Qingsong Wen; XiaoFeng Wang; and Wei Dong

arXiv:2512.01335·cs.CR·December 2, 2025

EmoRAG: Evaluating RAG Robustness to Symbolic Perturbations

Xinyun Zhou, Xinfeng Li, Yinan Peng, Ming Xu, Xuanwang Zhang, Miao Yu, Yidong Wang, Xiaojun Jia, Kun Wang, Qingsong Wen, XiaoFeng Wang, and Wei Dong

PDF

Open Access

TL;DR

This paper uncovers a critical vulnerability in Retrieval-Augmented Generation systems where subtle emoticon perturbations can drastically mislead retrieval, highlighting the need for more robust defenses.

Contribution

The study reveals the profound susceptibility of RAG systems to symbolic emoticon perturbations and evaluates defenses, emphasizing the importance of robustness in retrieval-based AI models.

Findings

01

Single emoticons nearly always dominate retrieval results.

02

Positional placement of emoticons greatly affects system robustness.

03

Larger models are more vulnerable to emoticon perturbations.

Abstract

Retrieval-Augmented Generation (RAG) systems are increasingly central to robust AI, enhancing large language model (LLM) faithfulness by incorporating external knowledge. However, our study unveils a critical, overlooked vulnerability: their profound susceptibility to subtle symbolic perturbations, particularly through near-imperceptible emoticon tokens such as "(@_@)" that can catastrophically mislead retrieval, termed EmoRAG. We demonstrate that injecting a single emoticon into a query makes it nearly 100% likely to retrieve semantically unrelated texts that contain a matching emoticon. Our extensive experiment across general question-answering and code domains, using a range of state-of-the-art retrievers and generators, reveals three key findings: (I) Single-Emoticon Disaster: Minimal emoticon injections cause maximal disruptions, with a single emoticon almost 100% dominating RAG…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Hate Speech and Cyberbullying Detection · Spam and Phishing Detection