Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

Yue Zhang; Yafu Li; Leyang Cui; Deng Cai; Lemao Liu; Tingchen Fu; Xinting Huang; Enbo Zhao; Yu Zhang; Chen Xu; Yulong Chen; Longyue Wang; Anh Tuan Luu; Wei Bi; Freda Shi; Shuming Shi

arXiv:2309.01219·cs.CL·September 16, 2025·238 cites

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

Yue Zhang, Yafu Li, Leyang Cui, Deng Cai, Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao, Yu Zhang, Chen Xu, Yulong Chen, Longyue Wang, Anh Tuan Luu, Wei Bi, Freda Shi, Shuming Shi

PDF

Open Access 1 Repo

TL;DR

This survey reviews recent research on hallucinations in large language models, focusing on detection, explanation, mitigation, and future challenges to improve their reliability in real-world applications.

Contribution

It provides a comprehensive taxonomy of hallucination phenomena, evaluates existing mitigation approaches, and discusses future research directions in LLM hallucination.

Findings

01

Taxonomies of hallucination types in LLMs

02

Evaluation benchmarks for hallucination detection

03

Analysis of current mitigation strategies

Abstract

While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input, contradicts previously generated context, or misaligns with established world knowledge. This phenomenon poses a substantial challenge to the reliability of LLMs in real-world scenarios. In this paper, we survey recent efforts on the detection, explanation, and mitigation of hallucination, with an emphasis on the unique challenges posed by LLMs. We present taxonomies of the LLM hallucination phenomena and evaluation benchmarks, analyze existing approaches aiming at mitigating LLM hallucination, and discuss potential directions for future research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hillzhang1999/llm-hallucination-survey
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning in Healthcare · COVID-19 diagnosis using AI