SearchSkill: Teaching LLMs to Use Search Tools with Evolving Skill Banks

Jinchao Hu; Meizhi Zhong; Kehai Chen; Min Zhang

arXiv:2605.09038·cs.AI·May 15, 2026

SearchSkill: Teaching LLMs to Use Search Tools with Evolving Skill Banks

Jinchao Hu, Meizhi Zhong, Kehai Chen, Min Zhang

PDF

TL;DR

SearchSkill introduces an evolving skill bank framework that enhances language models' ability to plan and execute search queries more effectively in open-domain question answering.

Contribution

It presents a novel two-stage training approach that explicitly models skill selection and grounding, improving retrieval efficiency and answer accuracy.

Findings

01

Improves exact match scores on knowledge-intensive QA benchmarks.

02

Reduces repeated and broad queries, focusing on atomic hops.

03

Achieves better retrieval behavior with fewer search steps.

Abstract

Teaching language models to use search tools is not only a question of whether they search, but also of whether they issue good queries. This is especially important in open-domain question answering, where broad or copied queries often waste retrieval budget and derail later reasoning. We propose \Ours, a framework that makes query planning explicit through reusable search skills. At each step, the model first selects a skill, then generates a search or answer action conditioned on the selected skill card. The skill inventory itself is not fixed: SearchSkill maintains an evolving SkillBank, expands or refines it from recurrent failure patterns, and reconstructs affected trajectories before supervised training. The resulting two-stage SFT recipe aligns training with the inference-time protocol of skill selection followed by skill-grounded execution. Across open-source and closed-source…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.