Large Language Models Hallucination: A Comprehensive Survey
Aisha Alansari, Hamzah Luqman

TL;DR
This comprehensive survey reviews the causes, detection, and mitigation of hallucinations in large language models, highlighting their impact on reliability and outlining future research directions.
Contribution
It provides a detailed taxonomy of hallucination types, analyzes root causes across the LLM development lifecycle, and evaluates current detection and mitigation strategies.
Findings
Hallucination types are categorized and linked to specific causes.
Current detection and mitigation methods have notable limitations.
Benchmark datasets and metrics for hallucination evaluation are reviewed.
Abstract
Large language models (LLMs) have transformed natural language processing, achieving remarkable performance across diverse tasks. However, their impressive fluency often comes at the cost of producing false or fabricated information, a phenomenon known as hallucination. Hallucination refers to the generation of content by an LLM that is fluent and syntactically correct but factually inaccurate or unsupported by external evidence. Hallucinations undermine the reliability and trustworthiness of LLMs, especially in domains requiring factual accuracy. This survey provides a comprehensive review of research on hallucination in LLMs, with a focus on causes, detection, and mitigation. We first present a taxonomy of hallucination types and analyze their root causes across the entire LLM development lifecycle, from data collection and architecture design to inference. We further examine how…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Misinformation and Its Impacts · Artificial Intelligence in Healthcare and Education
