The Hidden Threat in Plain Text: Attacking RAG Data Loaders

Alberto Castagnaro; Umberto Salviati; Mauro Conti; Luca Pajola; Simeone Pizzi

arXiv:2507.05093·cs.CR·July 8, 2025

The Hidden Threat in Plain Text: Attacking RAG Data Loaders

Alberto Castagnaro, Umberto Salviati, Mauro Conti, Luca Pajola, Simeone Pizzi

PDF

TL;DR

This paper reveals security vulnerabilities in RAG data loaders where malicious document manipulations can stealthily corrupt LLM outputs, demonstrating high attack success rates and emphasizing the need for improved defenses.

Contribution

It introduces a taxonomy of knowledge-based poisoning attacks, proposes two novel threat vectors, and provides an automated toolkit to evaluate vulnerabilities in RAG data loaders.

Findings

01

74.4% attack success rate across 357 scenarios

02

High success rates on six RAG systems including black-box services

03

Critical vulnerabilities bypassing filters and compromising output integrity

Abstract

Large Language Models (LLMs) have transformed human-machine interaction since ChatGPT's 2022 debut, with Retrieval-Augmented Generation (RAG) emerging as a key framework that enhances LLM outputs by integrating external knowledge. However, RAG's reliance on ingesting external documents introduces new vulnerabilities. This paper exposes a critical security gap at the data loading stage, where malicious actors can stealthily corrupt RAG pipelines by exploiting document ingestion. We propose a taxonomy of 9 knowledge-based poisoning attacks and introduce two novel threat vectors -- Content Obfuscation and Content Injection -- targeting common formats (DOCX, HTML, PDF). Using an automated toolkit implementing 19 stealthy injection techniques, we test five popular data loaders, finding a 74.4% attack success rate across 357 scenarios. We further validate these threats on six end-to-end RAG…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.