ShieldNet: Network-Level Guardrails against Emerging Supply-Chain Injections in Agentic Systems

Zhuowen Yuan; Zhaorun Chen; Zhen Xiang; Nathaniel D. Bastian; Seyyed Hadi Hashemi; Chaowei Xiao; Wenbo Guo; Bo Li

arXiv:2604.04426·cs.AI·April 7, 2026

ShieldNet: Network-Level Guardrails against Emerging Supply-Chain Injections in Agentic Systems

Zhuowen Yuan, Zhaorun Chen, Zhen Xiang, Nathaniel D. Bastian, Seyyed Hadi Hashemi, Chaowei Xiao, Wenbo Guo, Bo Li

PDF

TL;DR

This paper introduces ShieldNet, a network-level guardrail system that detects supply-chain threats in agent systems by analyzing network interactions, significantly improving detection accuracy over existing methods.

Contribution

The paper presents ShieldNet, a novel network-based framework for detecting supply-chain attacks in agent systems, and introduces SC-Inject-Bench, a comprehensive benchmark for evaluating such threats.

Findings

01

ShieldNet achieves up to 0.995 F-1 score in detection.

02

ShieldNet introduces minimal runtime overhead.

03

Existing MCP scanners perform poorly on supply-chain threat detection.

Abstract

Existing research on LLM agent security mainly focuses on prompt injection and unsafe input/output behaviors. However, as agents increasingly rely on third-party tools and MCP servers, a new class of supply-chain threats has emerged, where malicious behaviors are embedded in seemingly benign tools, silently hijacking agent execution, leaking sensitive data, or triggering unauthorized actions. Despite their growing impact, there is currently no comprehensive benchmark for evaluating such threats. To bridge this gap, we introduce SC-Inject-Bench, a large-scale benchmark comprising over 10,000 malicious MCP tools grounded in a taxonomy of 25+ attack types derived from MITRE ATT&CK targeting supply-chain threats. We observe that existing MCP scanners and semantic guardrails perform poorly on this benchmark. Motivated by this finding, we propose ShieldNet, a network-level guardrail framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.