Graph-theoretic Agreement Framework for Multi-agent LLM Systems
Muhammad Umar Javed

TL;DR
This paper introduces a graph-theoretic framework to analyze and improve consensus stability in multi-agent large language model systems, addressing issues of unobservable states and adversarial critique cycles.
Contribution
It establishes a formal mapping between LLM reasoning and graph theory, providing new theorems and algorithms for ensuring stable consensus in distributed LLM architectures.
Findings
Unbalanced critique cycles cause reasoning oscillations.
Unobservable states act as topological Trojan horses destabilizing consensus.
Chordal graph restrictions and spectral shifts improve stability.
Abstract
The shift from monolithic LLMs to distributed multi-agent architectures demands new frameworks for verifying and securing autonomous coordination. Unlike traditional multi-agent systems focused on cooperative state alignment, modern LLM patterns: multi-agent debate, constitutional oversight, helper-critic loops-rely on adversarial critique for error correction and reasoning refinement. Since LLMs are dynamical systems whose latent states are imperfectly observable from verbalized outputs, securing these networks requires understanding both macroscopic topology and microscopic agent observability. This paper establishes a rigorous graph-theoretic framework for analyzing consensus in signed, directed interaction networks, bridging graph theory and LLM reasoning by formally mapping Transformer cross-entropy log-odds to the signed Laplacian. We characterize agreement stability through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Formal Methods in Verification · Logic, Reasoning, and Knowledge
