TL;DR
This study empirically evaluates the security and maintainability of 1,899 open-source MCP servers, revealing vulnerabilities and emphasizing the need for MCP-specific security measures and better ecosystem governance.
Contribution
First large-scale empirical analysis of MCP servers assessing health, security, and maintainability, highlighting unique vulnerabilities and proposing targeted detection and governance strategies.
Findings
Identified eight distinct vulnerabilities, three overlapping with traditional software issues.
7.2% of servers contain general vulnerabilities, 5.5% have MCP-specific tool poisoning.
66% of servers exhibit code smells, with 14.4% showing overlapping bug patterns.
Abstract
Although Foundation Models (FMs), such as GPT-4, are increasingly used in domains like finance and software engineering, reliance on textual interfaces limits these models' real-world interaction. To address this, FM providers introduced a tool called -- triggering a proliferation of frameworks with distinct tool interfaces. In late 2024, Anthropic introduced the Model Context Protocol (MCP) to standardize this tool ecosystem. MCP is rapidly emerging as a de facto industry standard. Despite its adoption, MCP's AI-driven, non-deterministic control flow introduces new risks to sustainability, security, and maintainability, warranting closer examination. Towards this end, we present the first large-scale empirical study of MCP. Using state-of-the-art health metrics and a hybrid analysis pipeline that combines a general-purpose static analysis tool with an MCP-specific scanner, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
