Securing Retrieval-Augmented Generation: A Taxonomy of Attacks, Defenses, and Future Directions

Yuming Xu; Mingtao Zhang; Zhuohan Ge; Haoyang Li; Nicole Hu; Jason Chen Zhang; Qing Li; Lei Chen

arXiv:2604.08304·cs.CR·April 10, 2026

Securing Retrieval-Augmented Generation: A Taxonomy of Attacks, Defenses, and Future Directions

Yuming Xu, Mingtao Zhang, Zhuohan Ge, Haoyang Li, Nicole Hu, Jason Chen Zhang, Qing Li, Lei Chen

PDF

TL;DR

This paper provides a comprehensive taxonomy of security threats, defenses, and future research directions for retrieval-augmented generation (RAG), emphasizing the importance of securing the external knowledge-access pipeline.

Contribution

It introduces a new perspective on RAG security by defining operational boundaries and organizing vulnerabilities and defenses across the RAG workflow stages.

Findings

01

Current defenses are largely reactive and fragmented.

02

The paper categorizes threats into four primary security surfaces.

03

It proposes future directions for layered, boundary-aware protection.

Abstract

Retrieval-augmented generation (RAG) significantly enhances large language models (LLMs) but introduces novel security risks through external knowledge access. While existing studies cover various RAG vulnerabilities, they often conflate inherent LLM risks with those specifically introduced by RAG. In this paper, we propose that secure RAG is fundamentally about the security of the external knowledge-access pipeline. We establish an operational boundary to separate inherent LLM flaws from RAG-introduced or RAG-amplified threats. Guided by this perspective, we abstract the RAG workflow into six stages and organize the literature around three trust boundaries and four primary security surfaces, including pre-retrieval knowledge corruption, retrieval-time access manipulation, downstream context exploitation, and knowledge exfiltration. By systematically reviewing the corresponding attacks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.