Unleashing Worms and Extracting Data: Escalating the Outcome of Attacks   against RAG-based Inference in Scale and Severity Using Jailbreaking

Stav Cohen; Ron Bitton; Ben Nassi

arXiv:2409.08045·cs.CR·January 31, 2025·2 cites

Unleashing Worms and Extracting Data: Escalating the Outcome of Attacks against RAG-based Inference in Scale and Severity Using Jailbreaking

Stav Cohen, Ron Bitton, Ben Nassi

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates how jailbreaking GenAI models can escalate attacks on RAG-based systems, leading to data extraction and ecosystem-wide data poisoning, highlighting significant security vulnerabilities.

Contribution

It introduces methods to escalate RAG inference attacks and proposes a self-replicating worm model for large-scale data poisoning within GenAI ecosystems.

Findings

01

Attackers can extract up to 99.8% of database data.

02

The worm can propagate across applications, amplifying damage.

03

Guardrails have tradeoffs in protecting RAG systems.

Abstract

In this paper, we show that with the ability to jailbreak a GenAI model, attackers can escalate the outcome of attacks against RAG-based GenAI-powered applications in severity and scale. In the first part of the paper, we show that attackers can escalate RAG membership inference attacks and RAG entity extraction attacks to RAG documents extraction attacks, forcing a more severe outcome compared to existing attacks. We evaluate the results obtained from three extraction methods, the influence of the type and the size of five embeddings algorithms employed, the size of the provided context, and the GenAI engine. We show that attackers can extract 80%-99.8% of the data stored in the database used by the RAG of a Q&A chatbot. In the second part of the paper, we show that attackers can escalate the scale of RAG data poisoning attacks from compromising a single GenAI-powered application to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stavc/unleashingworms-extractingdata
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCybercrime and Law Enforcement Studies · Crime Patterns and Interventions · Deception detection and forensic psychology

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Softmax · Layer Normalization · Dropout · WordPiece · Residual Connection · Attention Dropout · Linear Layer