A Formal Security Framework for MCP-Based AI Agents: Threat Taxonomy, Verification Models, and Defense Mechanisms
Nirajan Acharya, Gaurav Kumar Gupta

TL;DR
This paper introduces MCPSHIELD, a formal security framework for MCP-based AI agents, including threat taxonomy, verification models, and a comprehensive defense architecture to address security gaps.
Contribution
It provides a hierarchical threat taxonomy, a formal verification model, evaluates existing defenses, and proposes an integrated defense architecture for MCP-based AI agents.
Findings
No single existing defense covers more than 34% of threats.
MCPSHIELD's architecture achieves 91% theoretical threat coverage.
Identifies seven open research challenges for secure agentic AI.
Abstract
The Model Context Protocol (MCP), introduced by Anthropic in November 2024 and now governed by the Linux Foundation's Agentic AI Foundation, has rapidly become the de facto standard for connecting large language model (LLM)-based agents to external tools and data sources, with over 97 million monthly SDK downloads and more than 177000 registered tools. However, this explosive adoption has exposed a critical gap: the absence of a unified, formal security framework capable of systematically characterizing, analyzing, and mitigating the diverse threats facing MCP-based agent ecosystems. Existing security research remains fragmented across individual attack papers, isolated benchmarks, and point defense mechanisms. This paper presents MCPSHIELD, a comprehensive formal security framework for MCP-based AI agents. We make four principal contributions: (1) a hierarchical threat taxonomy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
