Adversarial Threat Vectors and Risk Mitigation for Retrieval-Augmented Generation Systems

Chris M. Ward; Josh Harguess

arXiv:2506.00281·cs.CR·June 3, 2025

Adversarial Threat Vectors and Risk Mitigation for Retrieval-Augmented Generation Systems

Chris M. Ward, Josh Harguess

PDF

TL;DR

This paper identifies key adversarial threats to Retrieval-Augmented Generation systems, analyzes their risks, and proposes mitigation strategies like input validation and real-time monitoring to enhance system security.

Contribution

It provides a comprehensive analysis of adversarial attack vectors on RAG systems and introduces a prioritized control list for risk mitigation.

Findings

01

Prompt injection is a major threat.

02

Adversarial training improves robustness.

03

Real-time monitoring helps detect attacks.

Abstract

Retrieval-Augmented Generation (RAG) systems, which integrate Large Language Models (LLMs) with external knowledge sources, are vulnerable to a range of adversarial attack vectors. This paper examines the importance of RAG systems through recent industry adoption trends and identifies the prominent attack vectors for RAG: prompt injection, data poisoning, and adversarial query manipulation. We analyze these threats under risk management lens, and propose robust prioritized control list that includes risk-mitigating actions like input validation, adversarial training, and real-time monitoring.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Attention Dropout · Softmax · WordPiece · Refunds@Expedia|||How do I get a full refund from Expedia? · Weight Decay · Multi-Head Attention · Attention Is All You Need · Linear Warmup With Linear Decay · Dropout