EmoRAG: Evaluating RAG Robustness to Symbolic Perturbations
Xinyun Zhou, Xinfeng Li, Yinan Peng, Ming Xu, Xuanwang Zhang, Miao Yu, Yidong Wang, Xiaojun Jia, Kun Wang, Qingsong Wen, XiaoFeng Wang, and Wei Dong

TL;DR
This paper uncovers a critical vulnerability in Retrieval-Augmented Generation systems where subtle emoticon perturbations can drastically mislead retrieval, highlighting the need for more robust defenses.
Contribution
The study reveals the profound susceptibility of RAG systems to symbolic emoticon perturbations and evaluates defenses, emphasizing the importance of robustness in retrieval-based AI models.
Findings
Single emoticons nearly always dominate retrieval results.
Positional placement of emoticons greatly affects system robustness.
Larger models are more vulnerable to emoticon perturbations.
Abstract
Retrieval-Augmented Generation (RAG) systems are increasingly central to robust AI, enhancing large language model (LLM) faithfulness by incorporating external knowledge. However, our study unveils a critical, overlooked vulnerability: their profound susceptibility to subtle symbolic perturbations, particularly through near-imperceptible emoticon tokens such as "(@_@)" that can catastrophically mislead retrieval, termed EmoRAG. We demonstrate that injecting a single emoticon into a query makes it nearly 100% likely to retrieve semantically unrelated texts that contain a matching emoticon. Our extensive experiment across general question-answering and code domains, using a range of state-of-the-art retrievers and generators, reveals three key findings: (I) Single-Emoticon Disaster: Minimal emoticon injections cause maximal disruptions, with a single emoticon almost 100% dominating RAG…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Hate Speech and Cyberbullying Detection · Spam and Phishing Detection
