TL;DR
This paper introduces a Bayesian approach to quantify uncertainty in LLM-based systems by interpreting prompts as textual parameters and employing a novel MCMC algorithm, MHLP, to improve predictive accuracy and calibration.
Contribution
It presents a new Bayesian framework for prompt uncertainty quantification and introduces MHLP, a practical MCMC method compatible with existing LLM pipelines, including closed-source models.
Findings
Improved predictive accuracy on LLM benchmarks.
Enhanced uncertainty quantification and calibration.
Method applicable to closed-source LLMs.
Abstract
Although large language models (LLMs) are becoming increasingly capable of solving challenging real-world tasks, accurately quantifying their uncertainty remains a critical open problem--one that limits their applicability in high-stakes domains. This challenge is further compounded by the closed-source, black-box nature of many state-of-the-art LLMs. Moreover, LLM-based systems can be highly sensitive to the prompts that bind them together, which often require significant manual tuning (i.e., prompt engineering). In this work, we address these challenges by viewing LLM-based systems through a Bayesian lens. We interpret prompts as textual parameters in a statistical model, allowing us to use a small training dataset to perform Bayesian inference over these prompts. This novel perspective enables principled uncertainty quantification over both the model's textual parameters and its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
