From Context to Skills: Can Language Models Learn from Context Skillfully?

Shuzheng Si; Haozhe Zhao; Yu Lei; Qingyi Wang; Dingwei Chen; Zhitong Wang; Zhenhailong Wang; Kangyang Luo; Zheng Wang; Gang Chen; Fanchao Qi; Minjia Zhang; Maosong Sun

arXiv:2604.27660·cs.AI·May 5, 2026

From Context to Skills: Can Language Models Learn from Context Skillfully?

Shuzheng Si, Haozhe Zhao, Yu Lei, Qingyi Wang, Dingwei Chen, Zhitong Wang, Zhenhailong Wang, Kangyang Luo, Zheng Wang, Gang Chen, Fanchao Qi, Minjia Zhang, Maosong Sun

PDF

1 Repo

TL;DR

This paper introduces Ctx2Skill, a self-evolving framework enabling language models to autonomously discover and refine context-specific skills, enhancing their reasoning over complex, dense contexts without human supervision.

Contribution

It proposes a multi-agent self-play system with automated skill discovery and refinement, addressing manual annotation costs and lack of external feedback in context learning.

Findings

01

Improves solving rates on four CL-bench tasks across various models.

02

Automatically discovers and refines context-specific skills without human supervision.

03

Ensures robust skill evolution with a Cross-time Replay mechanism.

Abstract

Many real-world tasks require language models (LMs) to reason over complex contexts that exceed their parametric knowledge. This calls for context learning, where LMs directly learn relevant knowledge from the given context. An intuitive solution is inference-time skill augmentation: extracting the rules and procedures from context into natural-language skills. However, constructing such skills for context learning scenarios faces two challenges: the prohibitive cost of manual skill annotation for long, technically dense contexts, and the lack of external feedback for automated skill construction. In this paper, we propose Ctx2Skill, a self-evolving framework that autonomously discovers, refines, and selects context-specific skills without human supervision or external feedback. At its core, a multi-agent self-play loop has a Challenger that generates probing tasks and rubrics, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

s1s-z/Ctx2Skill
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.