A Formal Framework for Uncertainty Analysis of Text Generation with Large Language Models
Steffen Herbold, Florian Lemmerich

TL;DR
This paper introduces a formal framework for measuring and analyzing the inherent uncertainty in text generation by large language models, considering prompts, generation, and interpretation as interconnected processes.
Contribution
It provides a unified formal framework that models all sources of uncertainty in LLM text generation and relates existing methods within this framework.
Findings
Framework models prompting, generation, and interpretation as interconnected processes.
Demonstrates how existing uncertainty methods fit within the framework.
Identifies additional aspects of uncertainty not previously studied.
Abstract
The generation of texts using Large Language Models (LLMs) is inherently uncertain, with sources of uncertainty being not only the generation of texts, but also the prompt used and the downstream interpretation. Within this work, we provide a formal framework for the measurement of uncertainty that takes these different aspects into account. Our framework models prompting, generation, and interpretation as interconnected autoregressive processes that can be combined into a single sampling tree. We introduce filters and objective functions to describe how different aspects of uncertainty can be expressed over the sampling tree and demonstrate how to express existing approaches towards uncertainty through these functions. With our framework we show not only how different methods are formally related and can be reduced to a common core, but also point out additional aspects of uncertainty…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
