Simple Prompt Injection Attacks Can Leak Personal Data Observed by LLM Agents During Task Execution

Meysam Alizadeh; Zeynab Samei; Daria Stetsenko; Fabrizio Gilardi

arXiv:2506.01055·cs.CR·June 3, 2025

Simple Prompt Injection Attacks Can Leak Personal Data Observed by LLM Agents During Task Execution

Meysam Alizadeh, Zeynab Samei, Daria Stetsenko, Fabrizio Gilardi

PDF

Open Access

TL;DR

This paper investigates how prompt injection attacks can cause LLM-based agents to leak personal data during task execution, revealing vulnerabilities in agent security and safety measures.

Contribution

It introduces data flow-based prompt injection attacks in a banking scenario and evaluates their impact on agent utility and data leakage across multiple tasks.

Findings

01

LLMs experience a 15-50% utility drop under attack

02

Average attack success rate (ASR) is around 20% in initial tests

03

Some defenses can reduce ASR to zero

Abstract

Previous benchmarks on prompt injection in large language models (LLMs) have primarily focused on generic tasks and attacks, offering limited insights into more complex threats like data exfiltration. This paper examines how prompt injection can cause tool-calling agents to leak personal data observed during task execution. Using a fictitious banking agent, we develop data flow-based attacks and integrate them into AgentDojo, a recent benchmark for agentic security. To enhance its scope, we also create a richer synthetic dataset of human-AI banking conversations. In 16 user tasks from AgentDojo, LLMs show a 15-50 percentage point drop in utility under attack, with average attack success rates (ASR) around 20 percent; some defenses reduce ASR to zero. Most LLMs, even when successfully tricked by the attack, avoid leaking highly sensitive data like passwords, likely due to safety…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital and Cyber Forensics · Security and Verification in Computing · Cloud Data Security Solutions