Zombie Agents: Persistent Control of Self-Evolving LLM Agents via Self-Reinforcing Injections
Xianglin Yang, Yufei He, Shuo Ji, Bryan Hooi, Jin Song Dong

TL;DR
This paper investigates security vulnerabilities in self-evolving LLM agents, demonstrating how malicious payloads can persist across sessions and cause unauthorized actions, highlighting the need for improved defenses.
Contribution
It formalizes the Zombie Agent attack, introduces a black-box attack framework, and proposes persistence strategies to defend against long-term memory injections in LLM agents.
Findings
Memory injection can persist over multiple sessions.
Current defenses are insufficient against persistent attacks.
Proposed strategies improve resilience of memory systems.
Abstract
Self-evolving LLM agents update their internal state across sessions, often by writing and reusing long-term memory. This design improves performance on long-horizon tasks but creates a security risk: untrusted external content observed during a benign session can be stored as memory and later treated as instruction. We study this risk and formalize a persistent attack we call a Zombie Agent, where an attacker covertly implants a payload that survives across sessions, effectively turning the agent into a puppet of the attacker. We present a black-box attack framework that uses only indirect exposure through attacker-controlled web content. The attack has two phases. During infection, the agent reads a poisoned source while completing a benign task and writes the payload into long-term memory through its normal update process. During trigger, the payload is retrieved or carried forward…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSecurity and Verification in Computing · Advanced Software Engineering Methodologies · Advanced Malware Detection Techniques
