Poster: ClawdGo: Endogenous Security Awareness Training for Autonomous AI Agents

Jiaqi Li; Yang Zhao; Bin Sun; Yang Yu; Jian Chang; Lidong Zhai

arXiv:2604.24020·cs.CR·April 28, 2026

Poster: ClawdGo: Endogenous Security Awareness Training for Autonomous AI Agents

Jiaqi Li, Yang Zhao, Bin Sun, Yang Yu, Jian Chang, Lidong Zhai

PDF

TL;DR

ClawdGo introduces a novel endogenous security training framework for autonomous AI agents, enabling threat recognition and reasoning at inference time without model modification.

Contribution

It presents four innovative components—TLDT, ASAT, CSMA, and SACP—that collectively enhance security awareness training for autonomous AI agents.

Findings

01

Weakest-first ASAT improves TLDT score from 80.9 to 96.9.

02

CSMA maintains full skill gains across sessions.

03

E-mode covers all 12 threat dimensions in scenarios.

Abstract

Autonomous AI agents deployed on platforms such as OpenClaw face prompt injection, memory poisoning, supply-chain attacks, and social engineering, yet existing defences address only the platform perimeter, leaving the agent's own threat judgement entirely untrained. We present ClawdGo, a framework for endogenous security awareness training: we teach the agent to recognise and reason about threats from the inside, at inference time, with no model modification. Four contributions are introduced: TLDT (Three-Layer Domain Taxonomy) organises 12 trainable dimensions across Self-Defence, Owner-Protection, and Enterprise-Security layers; ASAT (Autonomous Security Awareness Training) is a self-play loop where the agent alternates attacker, defender, and evaluator roles under weakest-first curriculum scheduling; CSMA (Cross-Session Memory Accumulation) compounds skill gains via a four-layer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.