Zero-knowledge LLM hallucination detection and mitigation through fine-grained cross-model consistency

Aman Goel; Daniel Schwartz; Yanjun Qi

arXiv:2508.14314·cs.CL·November 4, 2025

Zero-knowledge LLM hallucination detection and mitigation through fine-grained cross-model consistency

Aman Goel, Daniel Schwartz, Yanjun Qi

PDF

Open Access 1 Video

TL;DR

Finch-Zk is a black-box framework that detects and mitigates hallucinations in large language models by leveraging fine-grained cross-model consistency checks, significantly improving factual accuracy without external knowledge sources.

Contribution

The paper introduces Finch-Zk, a novel approach for hallucination detection and mitigation in LLMs using cross-model consistency, without external knowledge, enhancing factual reliability.

Findings

01

Improves hallucination detection F1 scores by 6-39%.

02

Achieves up to 9% improvement in answer accuracy.

03

Demonstrates effectiveness on multiple datasets and models.

Abstract

Large language models (LLMs) have demonstrated impressive capabilities across diverse tasks, but they remain susceptible to hallucinations--generating content that appears plausible but contains factual inaccuracies. We present Finch-Zk, a black-box framework that leverages fine-grained cross-model consistency to detect and mitigate hallucinations in LLM outputs without requiring external knowledge sources. Finch-Zk introduces two key innovations: 1) a cross-model consistency checking strategy that reveals fine-grained inaccuracies by comparing responses generated by diverse models from semantically-equivalent prompts, and 2) a targeted mitigation technique that applies precise corrections to problematic segments while preserving accurate content. Experiments on the FELM dataset show Finch-Zk improves hallucination detection F1 scores by 6-39\% compared to existing approaches. For…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Zero-knowledge LLM hallucination detection and mitigation through fine-grained cross-model consistency· underline

Taxonomy

TopicsTopological and Geometric Data Analysis