Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation

Zhisheng Qi; Utkarsh Sahu; Li Ma; Haoyu Han; Ryan Rossi; Franck Dernoncourt; Mahantesh Halappanavar; Nesreen Ahmed; Yushun Dong; Yue Zhao; Yu Zhang; Yu Wang

arXiv:2602.09319·cs.CR·February 13, 2026

Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation

Zhisheng Qi, Utkarsh Sahu, Li Ma, Haoyu Han, Ryan Rossi, Franck Dernoncourt, Mahantesh Halappanavar, Nesreen Ahmed, Yushun Dong, Yue Zhao, Yu Zhang, Yu Wang

PDF

Open Access

TL;DR

This paper introduces the first comprehensive benchmark for evaluating knowledge-extraction attacks and defenses on Retrieval-Augmented Generation systems, standardizing evaluation protocols across diverse models and datasets.

Contribution

It provides a unified, reproducible framework for assessing attack and defense strategies on RAG systems, addressing the fragmented research landscape.

Findings

01

Benchmark covers multiple attack and defense strategies.

02

Standardized evaluation protocols across datasets.

03

Insights into the effectiveness of various defenses.

Abstract

Retrieval-Augmented Generation (RAG) has become a cornerstone of knowledge-intensive applications, including enterprise chatbots, healthcare assistants, and agentic memory management. However, recent studies show that knowledge-extraction attacks can recover sensitive knowledge-base content through maliciously crafted queries, raising serious concerns about intellectual property theft and privacy leakage. While prior work has explored individual attack and defense techniques, the research landscape remains fragmented, spanning heterogeneous retrieval embeddings, diverse generation models, and evaluations based on non-standardized metrics and inconsistent datasets. To address this gap, we introduce the first systematic benchmark for knowledge-extraction attacks on RAG systems. Our benchmark covers a broad spectrum of attack and defense strategies, representative retrieval embedding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Malware Detection Techniques · Spam and Phishing Detection