Poster: ClawdGo: Endogenous Security Awareness Training for Autonomous AI Agents
Jiaqi Li, Yang Zhao, Bin Sun, Yang Yu, Jian Chang, Lidong Zhai

TL;DR
ClawdGo introduces a novel endogenous security training framework for autonomous AI agents, enabling threat recognition and reasoning at inference time without model modification.
Contribution
It presents four innovative components—TLDT, ASAT, CSMA, and SACP—that collectively enhance security awareness training for autonomous AI agents.
Findings
Weakest-first ASAT improves TLDT score from 80.9 to 96.9.
CSMA maintains full skill gains across sessions.
E-mode covers all 12 threat dimensions in scenarios.
Abstract
Autonomous AI agents deployed on platforms such as OpenClaw face prompt injection, memory poisoning, supply-chain attacks, and social engineering, yet existing defences address only the platform perimeter, leaving the agent's own threat judgement entirely untrained. We present ClawdGo, a framework for endogenous security awareness training: we teach the agent to recognise and reason about threats from the inside, at inference time, with no model modification. Four contributions are introduced: TLDT (Three-Layer Domain Taxonomy) organises 12 trainable dimensions across Self-Defence, Owner-Protection, and Enterprise-Security layers; ASAT (Autonomous Security Awareness Training) is a self-play loop where the agent alternates attacker, defender, and evaluator roles under weakest-first curriculum scheduling; CSMA (Cross-Session Memory Accumulation) compounds skill gains via a four-layer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
