We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems

Junfeng Fang; Zijun Yao; Ruipeng Wang; Haokai Ma; Xiang Wang; Tat-Seng Chua

arXiv:2506.13666·cs.LG·June 17, 2025

We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems

Junfeng Fang, Zijun Yao, Ruipeng Wang, Haokai Ma, Xiang Wang, Tat-Seng Chua

PDF

Open Access 1 Repo

TL;DR

This paper highlights the safety risks introduced by model context protocols (MCP) in large language model (LLM) agent systems, emphasizing the need for research on mitigation techniques to ensure safe deployment.

Contribution

The paper constructs a controlled framework to examine MCP safety issues, demonstrates real safety risks through pilot experiments, and proposes a research roadmap for safe MCP development.

Findings

01

Safety risks in MCP-powered systems are real and significant.

02

Defense against MCP safety threats is complex and non-trivial.

03

A roadmap for developing safe MCP-powered agent systems is proposed.

Abstract

The development of large language models (LLMs) has entered in a experience-driven era, flagged by the emergence of environment feedback-driven learning via reinforcement learning and tool-using agents. This encourages the emergenece of model context protocol (MCP), which defines the standard on how should a LLM interact with external services, such as \api and data. However, as MCP becomes the de facto standard for LLM agent systems, it also introduces new safety risks. In particular, MCP introduces third-party services, which are not controlled by the LLM developers, into the agent systems. These third-party MCP services provider are potentially malicious and have the economic incentives to exploit vulnerabilities and sabotage user-agent interactions. In this position paper, we advocate the research community in LLM safety to pay close attention to the new safety risks issues…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

littlelittlenine/safemcp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Software Engineering Methodologies · Security and Verification in Computing · Distributed systems and fault tolerance