Textual Bayes: Quantifying Prompt Uncertainty in LLM-Based Systems

Brendan Leigh Ross; No\"el Vouitsis; Atiyeh Ashari Ghomi; Rasa Hosseinzadeh; Ji Xin; Zhaoyan Liu; Yi Sui; Shiyi Hou; Kin Kwan Leung; Gabriel Loaiza-Ganem; Jesse C. Cresswell

arXiv:2506.10060·cs.LG·April 21, 2026

Textual Bayes: Quantifying Prompt Uncertainty in LLM-Based Systems

Brendan Leigh Ross, No\"el Vouitsis, Atiyeh Ashari Ghomi, Rasa Hosseinzadeh, Ji Xin, Zhaoyan Liu, Yi Sui, Shiyi Hou, Kin Kwan Leung, Gabriel Loaiza-Ganem, Jesse C. Cresswell

PDF

1 Video

TL;DR

This paper introduces a Bayesian approach to quantify uncertainty in LLM-based systems by interpreting prompts as textual parameters and employing a novel MCMC algorithm, MHLP, to improve predictive accuracy and calibration.

Contribution

It presents a new Bayesian framework for prompt uncertainty quantification and introduces MHLP, a practical MCMC method compatible with existing LLM pipelines, including closed-source models.

Findings

01

Improved predictive accuracy on LLM benchmarks.

02

Enhanced uncertainty quantification and calibration.

03

Method applicable to closed-source LLMs.

Abstract

Although large language models (LLMs) are becoming increasingly capable of solving challenging real-world tasks, accurately quantifying their uncertainty remains a critical open problem--one that limits their applicability in high-stakes domains. This challenge is further compounded by the closed-source, black-box nature of many state-of-the-art LLMs. Moreover, LLM-based systems can be highly sensitive to the prompts that bind them together, which often require significant manual tuning (i.e., prompt engineering). In this work, we address these challenges by viewing LLM-based systems through a Bayesian lens. We interpret prompts as textual parameters in a statistical model, allowing us to use a small training dataset to perform Bayesian inference over these prompts. This novel perspective enables principled uncertainty quantification over both the model's textual parameters and its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Textual Bayes: Quantifying Prompt Uncertainty in LLM-Based Systems· slideslive