Incentives or Ontology? A Structural Rebuttal to OpenAI's Hallucination Thesis
Richard Ackermann, Simeon Emanuilov

TL;DR
This paper argues that hallucinations in large language models are due to their architecture's inherent limitations in representing reality, not just misaligned incentives, and suggests hybrid systems for reliable AI.
Contribution
It provides a structural analysis of hallucinations, showing they are unavoidable in transformers without external truth-validation modules.
Findings
Hallucination is an architectural inevitability of transformers.
External truth-validation modules can eliminate hallucinations.
Incentive-based solutions are insufficient to address hallucinations.
Abstract
OpenAI has recently argued that hallucinations in large language models result primarily from misaligned evaluation incentives that reward confident guessing rather than epistemic humility. On this view, hallucination is a contingent behavioral artifact, remediable through improved benchmarks and reward structures. In this paper, we challenge that interpretation. Drawing on previous work on structural hallucination and empirical experiments using a Licensing Oracle, we argue that hallucination is not an optimization failure but an architectural inevitability of the transformer model. Transformers do not represent the world; they model statistical associations among tokens. Their embedding spaces form a pseudo-ontology derived from linguistic co-occurrence rather than world-referential structure. At ontological boundary conditions - regions where training data is sparse or incoherent -…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Advanced Graph Neural Networks
