Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection
Sarat Mudunuri, Jian Wan, Ally Qin, Srinivasan Manoharan

TL;DR
This paper introduces a vector-based semantic retrieval system for MCP tools that significantly reduces token overhead and improves tool relevance selection for LLMs, enhancing scalability and efficiency.
Contribution
It proposes a novel semantic indexing and dynamic selection framework for MCP tools, improving scalability and relevance in tool discovery for large language models.
Findings
99.6% reduction in token usage
97.1% hit rate at K=3
0.91 MRR on benchmark queries
Abstract
Large Language Models (LLMs) with tool-calling capabilities have demonstrated remarkable potential in executing complex tasks through external tool integration. The Model Context Protocol (MCP) has emerged as a standardized framework for connecting LLMs to diverse toolsets, with individual MCP servers potentially exposing dozens to hundreds of tools. However, current implementations face a critical scalability challenge: providing all available tools to the LLM context results in substantial token overhead, increased costs, reduced accuracy, and context window constraints. We present a semantic tool discovery architecture that addresses these challenges through vector-based retrieval. Our approach indexes MCP tools using dense embeddings that capture semantic relationships between tool capabilities and user intent, dynamically selecting only the most relevant tools (typically 3-5)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Big Data and Digital Economy
