Unleashing Worms and Extracting Data: Escalating the Outcome of Attacks against RAG-based Inference in Scale and Severity Using Jailbreaking
Stav Cohen, Ron Bitton, Ben Nassi

TL;DR
This paper demonstrates how jailbreaking GenAI models can escalate attacks on RAG-based systems, leading to data extraction and ecosystem-wide data poisoning, highlighting significant security vulnerabilities.
Contribution
It introduces methods to escalate RAG inference attacks and proposes a self-replicating worm model for large-scale data poisoning within GenAI ecosystems.
Findings
Attackers can extract up to 99.8% of database data.
The worm can propagate across applications, amplifying damage.
Guardrails have tradeoffs in protecting RAG systems.
Abstract
In this paper, we show that with the ability to jailbreak a GenAI model, attackers can escalate the outcome of attacks against RAG-based GenAI-powered applications in severity and scale. In the first part of the paper, we show that attackers can escalate RAG membership inference attacks and RAG entity extraction attacks to RAG documents extraction attacks, forcing a more severe outcome compared to existing attacks. We evaluate the results obtained from three extraction methods, the influence of the type and the size of five embeddings algorithms employed, the size of the provided context, and the GenAI engine. We show that attackers can extract 80%-99.8% of the data stored in the database used by the RAG of a Q&A chatbot. In the second part of the paper, we show that attackers can escalate the scale of RAG data poisoning attacks from compromising a single GenAI-powered application to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCybercrime and Law Enforcement Studies · Crime Patterns and Interventions · Deception detection and forensic psychology
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Softmax · Layer Normalization · Dropout · WordPiece · Residual Connection · Attention Dropout · Linear Layer
