Security Attack and Defense Strategies for Autonomous Agent Frameworks: A Layered Review with OpenClaw as a Case Study
Luyao Xu, Xiang Chen

TL;DR
This paper provides a layered review of security risks and defense strategies in autonomous agent frameworks using OpenClaw as a case study, highlighting cross-layer threats and future challenges.
Contribution
It offers a systematic layered analysis of security issues in autonomous agents, filling a gap in existing scattered studies and proposing future research directions.
Findings
Identifies security risks at four key layers of autonomous agent frameworks.
Shows threats can propagate across layers, affecting system integrity.
Highlights challenges like research imbalance and ecosystem trust issues.
Abstract
Autonomous agent frameworks built upon large language models (LLMs) are evolving into complex, tool-integrated, and continuously operating systems, introducing security risks beyond traditional prompt-level vulnerabilities. As this paradigm is still at an early stage of development, a timely and systematic understanding of its security implications is increasingly important. Although a growing body of work has examined different attack surfaces and defense problems in agent systems, existing studies remain scattered across individual aspects of agent security, and there is still a lack of a layered review on this topic. To address this gap, this survey presents a layered review of security risks and defense strategies in autonomous agent frameworks, with OpenClaw as a case study. We organize the analysis into four security-relevant layers: the context and instruction layer, the tool and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
