ClawLess: A Security Model of AI Agents

Hongyi Lu; Nian Liu; Shuai Wang; Fengwei Zhang

arXiv:2604.06284·cs.CR·April 9, 2026

ClawLess: A Security Model of AI Agents

Hongyi Lu, Nian Liu, Shuai Wang, Fengwei Zhang

PDF

TL;DR

ClawLess introduces a formal security framework for AI agents that enforces verified policies to mitigate risks from potentially adversarial autonomous agents, bridging formal models with practical enforcement.

Contribution

It presents a novel security model and enforcement mechanism for AI agents, ensuring security even if the agent behaves adversarially, using a formal policy system and kernel-level enforcement.

Findings

01

Formal security policies can be dynamically expressed and enforced.

02

The system ensures security guarantees under worst-case adversarial scenarios.

03

Practical enforcement is achieved via a user-space kernel with syscall interception.

Abstract

Autonomous AI agents powered by Large Language Models can reason, plan, and execute complex tasks, but their ability to autonomously retrieve information and run code introduces significant security risks. Existing approaches attempt to regulate agent behavior through training or prompting, which does not offer fundamental security guarantees. We present ClawLess, a security framework that enforces formally verified policies on AI agents under a worst-case threat model where the agent itself may be adversarial. ClawLess formalizes a fine-grained security model over system entities, trust scopes, and permissions to express dynamic policies that adapt to agents' runtime behavior. These policies are translated into concrete security rules and enforced through a user-space kernel augmented with BPF-based syscall interception. This approach bridges the formal security model with practical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.