HAD: HAllucination Detection Language Models Based on a Comprehensive Hallucination Taxonomy

Fan Xu; Xinyu Hu; Zhenghan Yu; Li Lin; Xu Zhang; Yang Zhang; Wei Zhou; Jinjie Gu; Xiaojun Wan

arXiv:2510.19318·cs.CL·October 23, 2025

HAD: HAllucination Detection Language Models Based on a Comprehensive Hallucination Taxonomy

Fan Xu, Xinyu Hu, Zhenghan Yu, Li Lin, Xu Zhang, Yang Zhang, Wei Zhou, Jinjie Gu, Xiaojun Wan

PDF

Open Access

TL;DR

This paper introduces a comprehensive hallucination taxonomy and the HAD models, which detect, identify, and correct hallucinations in language models across multiple tasks, demonstrating state-of-the-art performance and robustness.

Contribution

It proposes a detailed hallucination taxonomy and a versatile HAD model that integrates detection, span identification, and correction in a single inference process.

Findings

01

HAD models outperform existing baselines on multiple benchmarks.

02

The models achieve state-of-the-art results on HaluEval, FactCHD, and FaithBench.

03

The approach is effective across various NLG tasks and out-of-domain data.

Abstract

The increasing reliance on natural language generation (NLG) models, particularly large language models, has raised concerns about the reliability and accuracy of their outputs. A key challenge is hallucination, where models produce plausible but incorrect information. As a result, hallucination detection has become a critical task. In this work, we introduce a comprehensive hallucination taxonomy with 11 categories across various NLG tasks and propose the HAllucination Detection (HAD) models https://github.com/pku0xff/HAD, which integrate hallucination detection, span-level identification, and correction into a single inference process. Trained on an elaborate synthetic dataset of about 90K samples, our HAD models are versatile and can be applied to various NLG tasks. We also carefully annotate a test set for hallucination detection, called HADTest, which contains 2,248 samples.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Mental Health via Writing · Misinformation and Its Impacts