Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks

Yanming Mu; Hao Hu; Feiyang Li; Qiao Yuan; Jiang Wu; Zichuan Liu; Pengcheng Liu; Mei Wang; Hongwei Zhou; Yuling Liu

arXiv:2603.21654·cs.CR·March 24, 2026

Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks

Yanming Mu, Hao Hu, Feiyang Li, Qiao Yuan, Jiang Wu, Zichuan Liu, Pengcheng Liu, Mei Wang, Hongwei Zhou, Yuling Liu

PDF

Open Access

TL;DR

This paper provides the first comprehensive survey of security threats, defenses, and benchmarks for Retrieval-Augmented Generation systems, aiming to enhance their robustness and trustworthiness.

Contribution

It systematically analyzes RAG vulnerabilities, categorizes defense strategies, and consolidates evaluation benchmarks, offering a unified end-to-end security assessment framework.

Findings

01

Identifies core threat vectors like data poisoning and adversarial attacks.

02

Summarizes defense mechanisms including encryption and differential privacy.

03

Establishes a unified benchmark for RAG security evaluation.

Abstract

Retrieval-Augmented Generation (RAG) significantly mitigates the hallucinations and domain knowledge deficiency in large language models by incorporating external knowledge bases. However, the multi-module architecture of RAG introduces complex system-level security vulnerabilities. Guided by the RAG workflow, this paper analyzes the underlying vulnerability mechanisms and systematically categorizes core threat vectors such as data poisoning, adversarial attacks, and membership inference attacks. Based on this threat assessment, we construct a taxonomy of RAG defense technologies from a dual perspective encompassing both input and output stages. The input-side analysis reviews data protection mechanisms including dynamic access control, homomorphic encryption retrieval, and adversarial pre-filtering. The output-side examination summarizes advanced leakage prevention techniques such as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Advanced Malware Detection Techniques