Silent Egress: When Implicit Prompt Injection Makes LLM Agents Leak Without a Trace
Qianlong Lan, Anuj Kaul, Shaun Jones, Stephanie Westrum

TL;DR
This paper reveals a new security vulnerability called silent egress in agentic LLM systems, where malicious web content can covertly exfiltrate sensitive data through implicit prompt injection, bypassing common safety measures.
Contribution
It introduces the concept of silent egress, demonstrates its effectiveness through experiments, and evaluates defenses, emphasizing the importance of system and network-level protections over prompt hardening.
Findings
High success rate (89%) of silent egress attacks in experiments.
Most attacks (95%) bypass output-based safety checks.
System and network controls are more effective than prompt-level defenses.
Abstract
Agentic large language model systems increasingly automate tasks by retrieving URLs and calling external tools. We show that this workflow gives rise to implicit prompt injection: adversarial instructions embedded in automatically generated URL previews, including titles, metadata, and snippets, can introduce a system-level risk that we refer to as silent egress. Using a fully local and reproducible testbed, we demonstrate that a malicious web page can induce an agent to issue outbound requests that exfiltrate sensitive runtime context, even when the final response shown to the user appears harmless. In 480 experimental runs with a qwen2.5:7b-based agent, the attack succeeds with high probability (P (egress) =0.89), and 95% of successful attacks are not detected by output-based safety checks. We also introduce sharded exfiltration, where sensitive information is split across multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Spam and Phishing Detection · Topic Modeling
