Feedback-Guided Extraction of Knowledge Base from Retrieval-Augmented LLM Applications
Changyue Jiang, Xudong Pan, Geng Hong, Chenfu Bao, Yang Chen, Min Yang

TL;DR
This paper introduces CopyBreakRAG, a feedback-guided black-box attack that significantly improves the extraction of knowledge bases from retrieval-augmented LLM applications, highlighting security vulnerabilities.
Contribution
It presents a novel agent-based attack method that adaptively generates adversarial queries, surpassing prior approaches in extraction coverage under realistic black-box conditions.
Findings
Achieves 45% higher extraction coverage than previous methods.
Extracts over 70% of knowledge base data in commercial RAG applications.
Outperforms state-of-the-art black-box extraction techniques.
Abstract
Retrieval-Augmented Generation (RAG) expands the knowledge boundary of large language models (LLMs) by integrating external knowledge bases, whose construction is often time-consuming and laborious. If an adversary extracts the knowledge base verbatim, it not only severely infringes the owner's intellectual property but also enables the adversary to replicate the application's functionality for unfair competition. Previous works on knowledge base extraction are limited either by low extraction coverage (usually less than 4%) in query-based attacks or by impractical assumptions of white-box access in embedding-based optimization methods. In this work, we propose CopyBreakRAG, an agent-based black-box attack that reasons from feedback and adaptively generates new adversarial queries for progressive extraction. By balancing exploration and exploitation through curiosity-driven queries and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlockchain Technology Applications and Security · Auction Theory and Applications · Privacy-Preserving Technologies in Data
