A comprehensive taxonomy of hallucinations in Large Language Models
Manuel Cossio

TL;DR
This paper presents a detailed taxonomy of hallucinations in large language models, analyzing their types, causes, and implications, and discusses evaluation and mitigation strategies for responsible deployment.
Contribution
It offers the first comprehensive taxonomy of LLM hallucinations, including definitions, causes, manifestations, and mitigation approaches, highlighting their theoretical inevitability.
Findings
Hallucinations are categorized into intrinsic and extrinsic types.
Evaluation benchmarks and metrics for hallucination detection are surveyed.
Mitigation strategies include architectural and systemic approaches.
Abstract
Large language models (LLMs) have revolutionized natural language processing, yet their propensity for hallucination, generating plausible but factually incorrect or fabricated content, remains a critical challenge. This report provides a comprehensive taxonomy of LLM hallucinations, beginning with a formal definition and a theoretical framework that posits its inherent inevitability in computable LLMs, irrespective of architecture or training. It explores core distinctions, differentiating between intrinsic (contradicting input context) and extrinsic (inconsistent with training data or reality), as well as factuality (absolute correctness) and faithfulness (adherence to input). The report then details specific manifestations, including factual errors, contextual and logical inconsistencies, temporal disorientation, ethical violations, and task-specific hallucinations across domains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Mental Health via Writing
