Survey of Hallucination in Natural Language Generation

Ziwei Ji; Nayeon Lee; Rita Frieske; Tiezheng Yu; Dan Su; Yan Xu,; Etsuko Ishii; Yejin Bang; Delong Chen; Wenliang Dai; Ho Shu Chan; Andrea; Madotto; Pascale Fung

arXiv:2202.03629·cs.CL·July 16, 2024·96 cites

Survey of Hallucination in Natural Language Generation

Ziwei Ji, Nayeon Lee, Rita Frieske, Tiezheng Yu, Dan Su, Yan Xu,, Etsuko Ishii, Yejin Bang, Delong Chen, Wenliang Dai, Ho Shu Chan, Andrea, Madotto, Pascale Fung

PDF

Open Access

TL;DR

This survey reviews recent research on hallucinations in natural language generation, covering metrics, mitigation strategies, and task-specific challenges, including large language models, to guide future efforts in reducing unintended outputs.

Contribution

It provides the first comprehensive review of hallucination measurement and mitigation in NLG, covering various tasks and LLMs, and outlines future research directions.

Findings

01

Overview of metrics and mitigation methods for hallucinations.

02

Analysis of hallucination issues across multiple NLG tasks.

03

Discussion of hallucinations in large language models.

Abstract

Natural Language Generation (NLG) has improved exponentially in recent years thanks to the development of sequence-to-sequence deep learning technologies such as Transformer-based language models. This advancement has led to more fluent and coherent NLG, leading to improved development in downstream tasks such as abstractive summarization, dialogue generation and data-to-text generation. However, it is also apparent that deep learning based generation is prone to hallucinate unintended text, which degrades the system performance and fails to meet user expectations in many real-world scenarios. To address this issue, many studies have been presented in measuring and mitigating hallucinated texts, but these have never been reviewed in a comprehensive manner before. In this survey, we thus provide a broad overview of the research progress and challenges in the hallucination problem in NLG.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications