TL;DR
This paper introduces IKEA, a novel attack method that exploits RAG systems through benign queries, revealing significant privacy and copyright risks by effectively extracting internal knowledge without detection.
Contribution
IKEA is the first implicit knowledge extraction attack on RAG systems using natural queries, combining anchor concepts with innovative mechanisms to improve stealth and efficiency.
Findings
IKEA surpasses baseline methods by over 80% in extraction efficiency.
IKEA achieves over 90% success rate in knowledge extraction.
Substitute RAG built from IKEA's data performs comparably to original systems.
Abstract
Retrieval-Augmented Generation (RAG) systems enhance large language models (LLMs) by incorporating external knowledge bases, but this may expose them to extraction attacks, leading to potential copyright and privacy risks. However, existing extraction methods typically rely on malicious inputs such as prompt injection or jailbreaking, making them easily detectable via input- or output-level detection. In this paper, we introduce Implicit Knowledge Extraction Attack (IKEA), which conducts Knowledge Extraction on RAG systems through benign queries. Specifically, IKEA first leverages anchor concepts-keywords related to internal knowledge-to generate queries with a natural appearance, and then designs two mechanisms that lead anchor concepts to thoroughly "explore" the RAG's knowledge: (1) Experience Reflection Sampling, which samples anchor concepts based on past query-response histories,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Softmax · Attention Dropout · WordPiece · Linear Layer · Residual Connection · Byte Pair Encoding · Weight Decay
