Human-Imperceptible Retrieval Poisoning Attacks in LLM-Powered   Applications

Quan Zhang; Binqi Zeng; Chijin Zhou; Gwihwan Go; Heyuan Shi; Yu Jiang

arXiv:2404.17196·cs.CR·April 29, 2024·2 cites

Human-Imperceptible Retrieval Poisoning Attacks in LLM-Powered Applications

Quan Zhang, Binqi Zeng, Chijin Zhou, Gwihwan Go, Heyuan Shi, Yu Jiang

PDF

Open Access

TL;DR

This paper identifies a new security threat called retrieval poisoning in LLM-powered applications, where attackers craft imperceptible malicious documents to mislead retrieval-augmented generation, with high success rates demonstrated in experiments.

Contribution

It introduces the concept of retrieval poisoning, analyzes its mechanism in LLM frameworks, and empirically demonstrates its high success rate and potential risks.

Findings

01

Attackers can craft visually indistinguishable malicious documents.

02

Retrieval poisoning achieves an 88.33% success rate in misleading LLMs.

03

66.67% success rate in real-world application scenarios.

Abstract

Presently, with the assistance of advanced LLM application development frameworks, more and more LLM-powered applications can effortlessly augment the LLMs' knowledge with external content using the retrieval augmented generation (RAG) technique. However, these frameworks' designs do not have sufficient consideration of the risk of external content, thereby allowing attackers to undermine the applications developed with these frameworks. In this paper, we reveal a new threat to LLM-powered applications, termed retrieval poisoning, where attackers can guide the application to yield malicious responses during the RAG process. Specifically, through the analysis of LLM application frameworks, attackers can craft documents visually indistinguishable from benign ones. Despite the documents providing correct information, once they are used as reference sources for RAG, the application is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT-based Smart Home Systems · Advanced Malware Detection Techniques · Advanced Neural Network Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Linear Layer · Dense Connections · Linear Warmup With Linear Decay · Weight Decay · Adam · Layer Normalization · Attention Dropout