Data-Efficient Massive Tool Retrieval: A Reinforcement Learning Approach for Query-Tool Alignment with Language Models
Yuxiang Zhang, Xin Fan, Junjie Wang, Chongxian Chen, Fan Mo, Tetsuya, Sakai, Hayato Yamana

TL;DR
This paper introduces a new framework leveraging reinforcement learning to improve query-tool alignment in large language models, especially in low-resource scenarios, by framing tool retrieval as a massive retrieval task and demonstrating significant performance gains.
Contribution
The paper proposes a novel query-tool alignment framework using reinforcement learning and a new benchmark for massive tool retrieval, addressing input length constraints in LLMs.
Findings
Outperforms state-of-the-art models in top-5 and top-10 retrieval tasks.
Achieves up to 93.28% improvement in Sufficiency@k metric.
Demonstrates strong cross-dataset generalizability.
Abstract
Recent advancements in large language models (LLMs) integrated with external tools and APIs have successfully addressed complex tasks by using in-context learning or fine-tuning. Despite this progress, the vast scale of tool retrieval remains challenging due to stringent input length constraints. In response, we propose a pre-retrieval strategy from an extensive repository, effectively framing the problem as the massive tool retrieval (MTR) task. We introduce the MTRB (massive tool retrieval benchmark) to evaluate real-world tool-augmented LLM scenarios with a large number of tools. This benchmark is designed for low-resource scenarios and includes a diverse collection of tools with descriptions refined for consistency and clarity. It consists of three subsets, each containing 90 test samples and 10 training samples. To handle the low-resource MTR task, we raise a new query-tool…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Web Data Mining and Analysis · Data Mining Algorithms and Applications
