A Systematic Survey of Security Threats and Defenses in LLM-Based AI Agents: A Layered Attack Surface Framework
Kexin Chu

TL;DR
This paper introduces the Layered Attack Surface Model ( extsc{lasm}), a comprehensive framework for analyzing security threats in agentic AI systems across multiple layers and timescales, highlighting under-explored areas and gaps.
Contribution
It proposes a novel 7-layer, 4-temporality framework for security analysis in agentic AI, along with a cross-layer defense taxonomy and reproducible analysis tools.
Findings
Upper layers of the agentic stack are under-explored for long-horizon threats.
Many attack regions lack corresponding defenses.
Current benchmarks do not cover cross-session or sub-session failures.
Abstract
Agentic AI systems introduce a security surface that is qualitatively different from that of stateless LLMs. They persist memory, invoke external tools, coordinate with peer agents, and operate across sessions, allowing attacks to emerge not only at the prompt interface but also through architectural state, delegated authority, and long-horizon interactions. Existing security taxonomies, however, primarily organize threats by attack type, such as prompt injection or jailbreaking, and therefore obscure where in the agentic stack a threat arises and over what timescale it manifests. We propose the Layered Attack Surface Model (\lasm), a structural taxonomy for agentic AI security. \lasm decomposes the agentic stack into seven layers -- Foundation, Cognitive, Memory, Tool Execution, Multi-Agent Coordination, Ecosystem, and Governance -- and augments them with a four-class temporality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
