DynaTrust: Defending Multi-Agent Systems Against Sleeper Agents via Dynamic Trust Graphs

Yu Li; Qiang Hu; Yao Zhang; Lili Quan; Jiongchi Yu; Junjie Wang

arXiv:2603.15661·cs.AI·March 18, 2026

DynaTrust: Defending Multi-Agent Systems Against Sleeper Agents via Dynamic Trust Graphs

Yu Li, Qiang Hu, Yao Zhang, Lili Quan, Jiongchi Yu, Junjie Wang

PDF

Open Access

TL;DR

DynaTrust introduces a dynamic trust graph approach to defend multi-agent systems against sleeper agents by continuously updating trust and restructuring the graph, significantly improving detection success and reducing false positives.

Contribution

The paper presents DynaTrust, a novel dynamic trust graph method that adapts to evolving adversaries and maintains system utility, outperforming existing static defense techniques.

Findings

01

Increases defense success rate by 41.7%.

02

Achieves over 86% success rate under adversarial conditions.

03

Reduces false positive rate significantly.

Abstract

Large Language Model-based Multi-Agent Systems (MAS) have demonstrated remarkable collaborative reasoning capabilities but introduce new attack surfaces, such as the sleeper agent, which behave benignly during routine operation and gradually accumulate trust, only revealing malicious behaviors when specific conditions or triggers are met. Existing defense works primarily focus on static graph optimization or hierarchical data management, often failing to adapt to evolving adversarial strategies or suffering from high false-positive rates (FPR) due to rigid blocking policies. To address this, we propose DynaTrust, a novel defense method against sleeper agents. DynaTrust models MAS as a dynamic trust graph~(DTG), and treats trust as a continuous, evolving process rather than a static attribute. It dynamically updates the trust of each agent based on its historical behaviors and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Graph Neural Networks · Explainable Artificial Intelligence (XAI)