Audited Skill-Graph Self-Improvement for Agentic LLMs via Verifiable Rewards, Experience Synthesis, and Continual Memory

Ken Huang; Jerry Huang

arXiv:2512.23760·cs.CR·January 1, 2026

Audited Skill-Graph Self-Improvement for Agentic LLMs via Verifiable Rewards, Experience Synthesis, and Continual Memory

Ken Huang, Jerry Huang

PDF

Open Access

TL;DR

This paper introduces ASG-SI, a framework for self-improving agentic language models that emphasizes verifiable, reusable skills, transparent reward decomposition, and continual memory management to enhance security, auditability, and reproducibility.

Contribution

It proposes a novel skill-graph based self-improvement system with verifiable reward decomposition, experience synthesis, and memory control, addressing security and transparency challenges in agentic LLMs.

Findings

01

Demonstrates verifier-backed reward construction and skill compilation.

02

Shows measurable improvements in performance under continual task streams.

03

Provides a complete, runnable implementation of the proposed system.

Abstract

Reinforcement learning is increasingly used to transform large language models into agentic systems that act over long horizons, invoke tools, and manage memory under partial observability. While recent work has demonstrated performance gains through tool learning, verifiable rewards, and continual training, deployed self-improving agents raise unresolved security and governance challenges: optimization pressure can incentivize reward hacking, behavioral drift is difficult to audit or reproduce, and improvements are often entangled in opaque parameter updates rather than reusable, verifiable artifacts. This paper proposes Audited Skill-Graph Self-Improvement (ASG-SI), a framework that treats self-improvement as iterative compilation of an agent into a growing, auditable skill graph. Each candidate improvement is extracted from successful trajectories, normalized into a skill with an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks