Robust Multi-Agent LLMs under Byzantine Faults
Haejoon Lee, Vincent-Daniel Yun, Hyeonho Oh, Dimitra Panagou, Sai Praneeth Karimireddy

TL;DR
This paper introduces Self-Anchored Consensus (SAC), a decentralized protocol for multi-agent LLM systems that resists Byzantine faults and improves reliability without relying on leader-based methods.
Contribution
The paper proposes a novel decentralized filtering protocol, SAC, with robustness guarantees, enhancing multi-agent LLM systems against adversarial Byzantine agents.
Findings
SAC effectively suppresses Byzantine influence in experiments.
SAC improves performance across various communication topologies.
Prior methods degrade under adversarial conditions.
Abstract
Large language model (LLM) agents increasingly collaborate over peer-to-peer networks to improve their reliability. However, these same interactions can also become a source of vulnerability, as unreliable or Byzantine agents may sway neighboring agents toward incorrect conclusions and degrade overall system performance. Existing methods rely on leader-based coordination or self-reported confidence, both of which are susceptible to adversarial manipulation. We study decentralized LLM multi-agent systems (LLM-MAS) and propose Self-Anchored Consensus (SAC), a fully decentralized iterative filter-and-refine protocol in which agents iteratively exchange responses, locally evaluate and filter unreliable messages, and refine their own outputs. We present -robustness conditions for the communication graph that ensure honest agents preserve and propagate reliable information despite…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
