MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination

Zhuo Li; Yupeng Zhang; Pengyu Cheng; Jiajun Song; Mengyu Zhou; Hao Li; Shujie Hu; Yu Qin; Erchao Zhao; Xiaoxi Jiang; Guanjun Jiang

arXiv:2603.24579·cs.CL·March 26, 2026

MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination

Zhuo Li, Yupeng Zhang, Pengyu Cheng, Jiajun Song, Mengyu Zhou, Hao Li, Shujie Hu, Yu Qin, Erchao Zhao, Xiaoxi Jiang, Guanjun Jiang

PDF

Open Access

TL;DR

MARCH introduces a multi-agent reinforcement learning framework with three specialized agents to reduce hallucinations in LLMs by enforcing factual consistency through deliberate information asymmetry, significantly improving reliability.

Contribution

The paper proposes a novel multi-agent reinforcement learning framework, MARCH, that leverages agent specialization and information asymmetry to effectively reduce hallucinations in LLMs.

Findings

01

MARCH significantly lowers hallucination rates across benchmarks.

02

An 8B-parameter LLM with MARCH matches performance of larger closed-source models.

03

The multi-agent approach enhances factual adherence through co-evolution.

Abstract

Hallucination remains a critical bottleneck for large language models (LLMs), undermining their reliability in real-world applications, especially in Retrieval-Augmented Generation (RAG) systems. While existing hallucination detection methods employ LLM-as-a-judge to verify LLM outputs against retrieved evidence, they suffer from inherent confirmation bias, where the verifier inadvertently reproduces the errors of the original generation. To address this, we introduce Multi-Agent Reinforced Self-Check for Hallucination (MARCH), a framework that enforces rigorous factual alignment by leveraging deliberate information asymmetry. MARCH orchestrates a collaborative pipeline of three specialized agents: a Solver, a Proposer, and a Checker. The Solver generates an initial RAG response, which the Proposer decomposes into claim-level verifiable atomic propositions. Crucially, the Checker…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications