Hallucination, Monofacts, and Miscalibration: An Empirical Investigation
Miranda Muqing Miao, Michael Kearns

TL;DR
This paper empirically investigates the relationship between hallucinations, monofacts, and miscalibration in language models, introducing techniques to control and reduce hallucinations while analyzing their trade-offs.
Contribution
It provides the first empirical analysis of the three-way relationship in classical and transformer models, and introduces selective upweighting to reduce hallucinations.
Findings
Monofact rate positively correlates with hallucination.
Selective upweighting reduces hallucination by up to 40%.
Trade-off observed between accuracy and hallucination reduction.
Abstract
Hallucinated facts in large language models (LLMs) have recently been shown to obey a statistical lower bound determined by the monofact rate (related to the classical Good-Turing missing mass estimator) minus model miscalibration (Kalai & Vempala, 2024). We present the first empirical investigation of this three-way relationship in classical n-gram models and fine-tuned encoder-decoder Transformers. By generating training data from Pareto distributions with varying shape parameters, we systematically control the monofact rates and establish its positive relationship with hallucination. To bridge theory and practice, we derive an empirical analog of the hallucination bound by replacing the population miscalibration term (Section 2.1) with an empirical bin-wise KL divergence and confirm its practical viability. We then introduce selective upweighting -- a simple yet effective technique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHallucinations in medical conditions · Paranormal Experiences and Beliefs · Biofield Effects and Biophysics
