Teaming LLMs to Detect and Mitigate Hallucinations

Demian Till; John Smeaton; Peter Haubrick; Gouse Saheb; Florian Graef; David Berman

arXiv:2510.19507·cs.LG·October 24, 2025

Teaming LLMs to Detect and Mitigate Hallucinations

Demian Till, John Smeaton, Peter Haubrick, Gouse Saheb, Florian Graef, David Berman

PDF

Open Access

TL;DR

This paper introduces a consortium consistency approach that combines responses from multiple diverse LLMs to improve hallucination detection and mitigation, achieving better performance and lower inference costs than single-model methods.

Contribution

It extends single-model consistency techniques to multi-model ensembles, demonstrating significant improvements in hallucination mitigation across various LLMs and conditions.

Findings

01

Enhanced hallucination detection accuracy with multi-LLM ensembles

02

Reduced inference costs compared to single-model approaches

03

Effective across diverse model architectures and training data

Abstract

Recent work has demonstrated state-of-the-art results in large language model (LLM) hallucination detection and mitigation through consistency-based approaches which involve aggregating multiple responses sampled from a single LLM for a given prompt. These approaches help offset limitations stemming from the imperfect data on which LLMs are trained, which includes biases and under-representation of information required at deployment time among other limitations which can lead to hallucinations. We show that extending these single-model consistency methods to combine responses from multiple LLMs with different training data, training schemes and model architectures can result in substantial further improvements in hallucination detection and mitigation capabilities beyond their single-model consistency counterparts. We evaluate this "consortium consistency" approach across many model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Advanced Graph Neural Networks