Towards Unification of Hallucination Detection and Fact Verification for Large Language Models

Weihang Su; Jianming Long; Changyue Wang; Shiyu Lin; Jingyan Xu; Ziyi Ye; Qingyao Ai; Yiqun Liu

arXiv:2512.02772·cs.CL·December 3, 2025

Towards Unification of Hallucination Detection and Fact Verification for Large Language Models

Weihang Su, Jianming Long, Changyue Wang, Shiyu Lin, Jingyan Xu, Ziyi Ye, Qingyao Ai, Yiqun Liu

PDF

Open Access

TL;DR

This paper introduces UniFact, a unified framework for evaluating and comparing hallucination detection and fact verification in large language models, revealing their complementarity and proposing integrated approaches for improved factuality assessment.

Contribution

The paper presents UniFact, the first unified evaluation framework for hallucination detection and fact verification, facilitating direct comparison and integration of these paradigms in LLMs.

Findings

01

No single paradigm is universally best.

02

HD and FV capture different aspects of factual errors.

03

Hybrid methods outperform individual approaches.

Abstract

Large Language Models (LLMs) frequently exhibit hallucinations, generating content that appears fluent and coherent but is factually incorrect. Such errors undermine trust and hinder their adoption in real-world applications. To address this challenge, two distinct research paradigms have emerged: model-centric Hallucination Detection (HD) and text-centric Fact Verification (FV). Despite sharing the same goal, these paradigms have evolved in isolation, using distinct assumptions, datasets, and evaluation protocols. This separation has created a research schism that hinders their collective progress. In this work, we take a decisive step toward bridging this divide. We introduce UniFact, a unified evaluation framework that enables direct, instance-level comparison between FV and HD by dynamically generating model outputs and corresponding factuality labels. Through large-scale…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Mental Health via Writing · Misinformation and Its Impacts