Probabilistic Guarantees for Reducing Contextual Hallucinations in LLMs

Nils Rautenberg; Sven Schippkus

arXiv:2601.00641·cs.CL·January 5, 2026

Probabilistic Guarantees for Reducing Contextual Hallucinations in LLMs

Nils Rautenberg, Sven Schippkus

PDF

Open Access

TL;DR

This paper introduces a probabilistic framework that guarantees a significant reduction in hallucinations in large language models by using repeated prompts and ensemble judging, applicable in deterministic workflows without altering models.

Contribution

It formalizes a method to exponentially decrease hallucination probabilities in fixed-input LLM tasks using repeated prompts and ensemble judging, with theoretical guarantees.

Findings

01

Hallucination probability decreases exponentially with repeated prompts.

02

Ensemble judging further reduces hallucinations exponentially.

03

Experimental results match theoretical predictions exactly.

Abstract

Large language models (LLMs) frequently produce contextual hallucinations, where generated content contradicts or ignores information explicitly stated in the prompt. Such errors are particularly problematic in deterministic automation workflows, where inputs are fixed and correctness is unambiguous. We introduce a simple and model-agnostic framework that provides explicit probabilistic guarantees for reducing hallucinations in this setting. We formalize the notion of a specific task, defined by a fixed input and a deterministic correctness criterion, and show that issuing the same prompt in independent context windows yields an exponential reduction in the probability that all model outputs are incorrect. To identify a correct answer among repeated runs, we incorporate an LLM-as-a-judge and prove that the probability that the judged pipeline fails decays at a rate determined by the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Explainable Artificial Intelligence (XAI)