Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection

Sarat Mudunuri; Jian Wan; Ally Qin; Srinivasan Manoharan

arXiv:2603.20313·cs.SE·March 24, 2026

Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection

Sarat Mudunuri, Jian Wan, Ally Qin, Srinivasan Manoharan

PDF

Open Access

TL;DR

This paper introduces a vector-based semantic retrieval system for MCP tools that significantly reduces token overhead and improves tool relevance selection for LLMs, enhancing scalability and efficiency.

Contribution

It proposes a novel semantic indexing and dynamic selection framework for MCP tools, improving scalability and relevance in tool discovery for large language models.

Findings

01

99.6% reduction in token usage

02

97.1% hit rate at K=3

03

0.91 MRR on benchmark queries

Abstract

Large Language Models (LLMs) with tool-calling capabilities have demonstrated remarkable potential in executing complex tasks through external tool integration. The Model Context Protocol (MCP) has emerged as a standardized framework for connecting LLMs to diverse toolsets, with individual MCP servers potentially exposing dozens to hundreds of tools. However, current implementations face a critical scalability challenge: providing all available tools to the LLM context results in substantial token overhead, increased costs, reduced accuracy, and context window constraints. We present a semantic tool discovery architecture that addresses these challenges through vector-based retrieval. Our approach indexes MCP tools using dense embeddings that capture semantic relationships between tool capabilities and user intent, dynamically selecting only the most relevant tools (typically 3-5)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Big Data and Digital Economy