SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Zhengxi Lu; Zhiyuan Yao; Jinyang Wu; Chengcheng Han; Qi Gu; Xunliang Cai; Weiming Lu; Jun Xiao; Yueting Zhuang; Yongliang Shen

arXiv:2604.02268·cs.LG·May 18, 2026

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Zhengxi Lu, Zhiyuan Yao, Jinyang Wu, Chengcheng Han, Qi Gu, Xunliang Cai, Weiming Lu, Jun Xiao, Yueting Zhuang, Yongliang Shen

PDF

1 Repo

TL;DR

SKILL0 is a reinforcement learning framework that internalizes agent skills into model parameters, enabling zero-shot autonomous behavior with minimal context, improving efficiency and performance over traditional retrieval-based methods.

Contribution

The paper introduces SKILL0, a novel in-context reinforcement learning method for skill internalization, reducing reliance on runtime skill retrieval and enhancing agent autonomy.

Findings

01

SKILL0 improves task success rates significantly over RL baselines (+9.7%, +6.6%, +10.1%).

02

It maintains a compact context of fewer than 0.5k tokens per step.

03

Extensive experiments validate the effectiveness of skill internalization.

Abstract

Agent skills, structured packages of procedural knowledge and executable resources that agents dynamically load at inference time, have become a reliable mechanism for augmenting LLM agents. Yet inference-time skill augmentation is fundamentally limited: retrieval noise introduces irrelevant guidance, injected skill content imposes substantial token overhead, and the model never truly acquires the knowledge it merely follows. We ask whether skills can instead be internalized into model parameters, enabling zero-shot autonomous behavior without any runtime skill retrieval. We introduce SKILL0, an in-context reinforcement learning framework designed for skill internalization. SKILL0 introduces a training-time curriculum that begins with full skill context and progressively withdraws it. Skills are grouped offline by category and rendered with interaction history into a compact visual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZJU-REAL/SkillZero
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.