Trivial Trojans: How Minimal MCP Servers Enable Cross-Tool Exfiltration of Sensitive Data
Nicola Croce, Tobin South

TL;DR
This paper reveals that the Model Context Protocol (MCP) enables trivial cross-server data exfiltration attacks by unsophisticated actors, exposing significant security vulnerabilities in AI-tool integration ecosystems.
Contribution
It demonstrates that current MCP implementations are vulnerable to simple, low-skill attacks and proposes mitigations and protocol improvements to enhance security.
Findings
Attackers can exfiltrate sensitive data using minimal technical skills.
Trust relationships in MCP can be exploited for cross-server attacks.
Security gaps in MCP ecosystem pose significant risks for AI-tool integrations.
Abstract
The Model Context Protocol (MCP) represents a significant advancement in AI-tool integration, enabling seamless communication between AI agents and external services. However, this connectivity introduces novel attack vectors that remain largely unexplored. This paper demonstrates how unsophisticated threat actors, requiring only basic programming skills and free web tools, can exploit MCP's trust model to exfiltrate sensitive financial data. We present a proof-of-concept attack where a malicious weather MCP server, disguised as benign functionality, discovers and exploits legitimate banking tools to steal user account balances. The attack chain requires no advanced technical knowledge, server infrastructure, or monetary investment. The findings reveal a critical security gap in the emerging MCP ecosystem: while individual servers may appear trustworthy, their combination creates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
