A Jailbroken GenAI Model Can Cause Substantial Harm: GenAI-powered Applications are Vulnerable to PromptWares
Stav Cohen, Ron Bitton, Ben Nassi

TL;DR
This paper reveals that jailbroken GenAI models can be exploited to perform malicious activities, including denial-of-service and unauthorized data manipulation, posing significant security risks to AI-powered applications.
Contribution
It introduces PromptWare, a novel attack framework exploiting jailbreak techniques to manipulate GenAI applications, including advanced methods targeting unknown application logic.
Findings
PromptWare can force malicious execution flows in GenAI applications.
Advanced PromptWare Threat (APwT) can escalate privileges and manipulate data.
Attacks demonstrated on e-commerce chatbots to alter SQL data.
Abstract
In this paper we argue that a jailbroken GenAI model can cause substantial harm to GenAI-powered applications and facilitate PromptWare, a new type of attack that flips the GenAI model's behavior from serving an application to attacking it. PromptWare exploits user inputs to jailbreak a GenAI model to force/perform malicious activity within the context of a GenAI-powered application. First, we introduce a naive implementation of PromptWare that behaves as malware that targets Plan & Execute architectures (a.k.a., ReAct, function calling). We show that attackers could force a desired execution flow by creating a user input that produces desired outputs given that the logic of the GenAI-powered application is known to attackers. We demonstrate the application of a DoS attack that triggers the execution of a GenAI-powered assistant to enter an infinite loop that wastes money and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Healthcare Technology and Patient Monitoring
Methodstravel james
