Skill-Pro: Learning Reusable Skills from Experience via Non-Parametric PPO for LLM Agents

Qirui Mi; Zhijian Ma; Mengyue Yang; Haoxuan Li; Yisen Wang; Haifeng Zhang; Jun Wang

arXiv:2602.01869·cs.AI·April 17, 2026

Skill-Pro: Learning Reusable Skills from Experience via Non-Parametric PPO for LLM Agents

Qirui Mi, Zhijian Ma, Mengyue Yang, Haoxuan Li, Yisen Wang, Haifeng Zhang, Jun Wang

PDF

1 Datasets

TL;DR

Skill-Pro enables LLM agents to autonomously learn and reuse procedural skills from experience, improving efficiency and stability without parameter updates through a novel non-parametric reinforcement learning approach.

Contribution

The paper introduces Skill-Pro, a framework for autonomous skill learning and reuse in LLM agents using a non-parametric PPO method without parameter updates.

Findings

01

Skill-Pro achieves higher skill reuse rates across tasks and agents.

02

It significantly improves performance with extreme memory compression.

03

Visualizations show transparent accumulation and refinement of skills.

Abstract

LLM-driven agents demonstrate strong performance in sequential decision-making but often rely on on-the-fly reasoning, re-deriving solutions even in recurring scenarios. This insufficient experience reuse leads to computational redundancy and execution instability. To bridge this gap, we propose Skill-Pro, a framework that enables agents to autonomously learn reusable procedural skills from interaction experiences without parameter updates. By formalizing a Skill-MDP, Skill-Pro transforms passive episodic narratives into executable Skills defined by activation, execution, and termination conditions to ensure executability. To achieve reliable reusability without capability degradation, we introduce Non-Parametric PPO, which leverages semantic gradients for high-quality candidate generation and a PPO Gate for robust Skill verification. Through score-based maintenance, Skill-Pro sustains…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

molmohsen/awesome-ai-agent-papers
dataset· 39 dl
39 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.