GUARDIAN: Safeguarding LLM Multi-Agent Collaborations with Temporal Graph Modeling

Jialong Zhou; Lichao Wang; Xiao Yang

arXiv:2505.19234·cs.AI·October 16, 2025

GUARDIAN: Safeguarding LLM Multi-Agent Collaborations with Temporal Graph Modeling

Jialong Zhou, Lichao Wang, Xiao Yang

PDF

Open Access

TL;DR

GUARDIAN is a novel framework that models multi-agent LLM interactions as temporal graphs to detect and mitigate safety issues like hallucinations and errors with high precision.

Contribution

It introduces a graph-based approach with an unsupervised encoder-decoder and information bottleneck to effectively identify safety concerns in multi-agent LLM collaborations.

Findings

01

Achieves state-of-the-art accuracy in safety detection.

02

Effectively models propagation of hallucinations and errors.

03

Demonstrates efficient resource utilization.

Abstract

The emergence of large language models (LLMs) enables the development of intelligent agents capable of engaging in complex and multi-turn dialogues. However, multi-agent collaboration faces critical safety challenges, such as hallucination amplification and error injection and propagation. This paper presents GUARDIAN, a unified method for detecting and mitigating multiple safety concerns in GUARDing Intelligent Agent collaboratioNs. By modeling the multi-agent collaboration process as a discrete-time temporal attributed graph, GUARDIAN explicitly captures the propagation dynamics of hallucinations and errors. The unsupervised encoder-decoder architecture incorporating an incremental training paradigm learns to reconstruct node attributes and graph structures from latent embeddings, enabling the identification of anomalous nodes and edges with unparalleled precision. Moreover, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Data Quality and Management · Data Mining Algorithms and Applications