TL;DR
This paper introduces Ctx2Skill, a self-evolving framework enabling language models to autonomously discover and refine context-specific skills, enhancing their reasoning over complex, dense contexts without human supervision.
Contribution
It proposes a multi-agent self-play system with automated skill discovery and refinement, addressing manual annotation costs and lack of external feedback in context learning.
Findings
Improves solving rates on four CL-bench tasks across various models.
Automatically discovers and refines context-specific skills without human supervision.
Ensures robust skill evolution with a Cross-time Replay mechanism.
Abstract
Many real-world tasks require language models (LMs) to reason over complex contexts that exceed their parametric knowledge. This calls for context learning, where LMs directly learn relevant knowledge from the given context. An intuitive solution is inference-time skill augmentation: extracting the rules and procedures from context into natural-language skills. However, constructing such skills for context learning scenarios faces two challenges: the prohibitive cost of manual skill annotation for long, technically dense contexts, and the lack of external feedback for automated skill construction. In this paper, we propose Ctx2Skill, a self-evolving framework that autonomously discovers, refines, and selects context-specific skills without human supervision or external feedback. At its core, a multi-agent self-play loop has a Challenger that generates probing tasks and rubrics, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
