Don't believe everything you read: Understanding and Measuring MCP Behavior under Misleading Tool Descriptions

Zhihao Li; Boyang Ma; Xuelong Dai; Minghui Xu; Yue Zhang; Biwei Yan; Kun Li

arXiv:2602.03580·cs.CR·February 4, 2026

Don't believe everything you read: Understanding and Measuring MCP Behavior under Misleading Tool Descriptions

Zhihao Li, Boyang Ma, Xuelong Dai, Minghui Xu, Yue Zhang, Biwei Yan, Kun Li

PDF

Open Access

TL;DR

This paper investigates the security risks caused by inconsistencies between tool descriptions and implementations in MCP, revealing that about 13% of servers have significant mismatches that could enable malicious actions.

Contribution

It presents the first large-scale analysis of description-code mismatches in MCP, including an automated static analysis framework applied to over 10,000 servers, highlighting the prevalence and risks of such inconsistencies.

Findings

01

Approximately 13% of MCP servers exhibit substantial mismatches.

02

Mismatches can enable undocumented privileged operations or hidden state changes.

03

Systematic differences exist across application categories and marketplaces.

Abstract

The Model Context Protocol (MCP) enables large language models to invoke external tools through natural-language descriptions, forming the foundation of many AI agent applications. However, MCP does not enforce consistency between documented tool behavior and actual code execution, even though MCP Servers often run with broad system privileges. This gap introduces a largely unexplored security risk. We study how mismatches between externally presented tool descriptions and underlying implementations systematically shape the mental models and decision-making behavior of intelligent agents. Specifically, we present the first large-scale study of description-code inconsistency in the MCP ecosystem. We design an automated static analysis framework and apply it to 10,240 real-world MCP Servers across 36 categories. Our results show that while most servers are highly consistent, approximately…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)