Audited Skill-Graph Self-Improvement for Agentic LLMs via Verifiable Rewards, Experience Synthesis, and Continual Memory
Ken Huang, Jerry Huang

TL;DR
This paper introduces ASG-SI, a framework for self-improving agentic language models that emphasizes verifiable, reusable skills, transparent reward decomposition, and continual memory management to enhance security, auditability, and reproducibility.
Contribution
It proposes a novel skill-graph based self-improvement system with verifiable reward decomposition, experience synthesis, and memory control, addressing security and transparency challenges in agentic LLMs.
Findings
Demonstrates verifier-backed reward construction and skill compilation.
Shows measurable improvements in performance under continual task streams.
Provides a complete, runnable implementation of the proposed system.
Abstract
Reinforcement learning is increasingly used to transform large language models into agentic systems that act over long horizons, invoke tools, and manage memory under partial observability. While recent work has demonstrated performance gains through tool learning, verifiable rewards, and continual training, deployed self-improving agents raise unresolved security and governance challenges: optimization pressure can incentivize reward hacking, behavioral drift is difficult to audit or reproduce, and improvements are often entangled in opaque parameter updates rather than reusable, verifiable artifacts. This paper proposes Audited Skill-Graph Self-Improvement (ASG-SI), a framework that treats self-improvement as iterative compilation of an agent into a growing, auditable skill graph. Each candidate improvement is extracted from successful trajectories, normalized into a skill with an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks
