Probabilistic Guarantees for Reducing Contextual Hallucinations in LLMs
Nils Rautenberg, Sven Schippkus

TL;DR
This paper introduces a probabilistic framework that guarantees a significant reduction in hallucinations in large language models by using repeated prompts and ensemble judging, applicable in deterministic workflows without altering models.
Contribution
It formalizes a method to exponentially decrease hallucination probabilities in fixed-input LLM tasks using repeated prompts and ensemble judging, with theoretical guarantees.
Findings
Hallucination probability decreases exponentially with repeated prompts.
Ensemble judging further reduces hallucinations exponentially.
Experimental results match theoretical predictions exactly.
Abstract
Large language models (LLMs) frequently produce contextual hallucinations, where generated content contradicts or ignores information explicitly stated in the prompt. Such errors are particularly problematic in deterministic automation workflows, where inputs are fixed and correctness is unambiguous. We introduce a simple and model-agnostic framework that provides explicit probabilistic guarantees for reducing hallucinations in this setting. We formalize the notion of a specific task, defined by a fixed input and a deterministic correctness criterion, and show that issuing the same prompt in independent context windows yields an exponential reduction in the probability that all model outputs are incorrect. To identify a correct answer among repeated runs, we incorporate an LLM-as-a-judge and prove that the probability that the judged pipeline fails decays at a rate determined by the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Explainable Artificial Intelligence (XAI)
