UNCERTAINTY-LINE: Length-Invariant Estimation of Uncertainty for Large Language Models

Roman Vashurin; Maiya Goloburda; Preslav Nakov; Maxim Panov

arXiv:2505.19060·cs.CL·September 26, 2025

UNCERTAINTY-LINE: Length-Invariant Estimation of Uncertainty for Large Language Models

Roman Vashurin, Maiya Goloburda, Preslav Nakov, Maxim Panov

PDF

Open Access 1 Video

TL;DR

The paper introduces UNCERTAINTY-LINE, a simple post-hoc debiasing method that produces length-invariant uncertainty estimates for large language models, improving reliability across various tasks.

Contribution

It proposes a novel length-invariant uncertainty estimation technique that corrects biases in existing methods, applicable across different models and tasks.

Findings

01

UNCERTAINTY-LINE improves uncertainty estimates in translation, summarization, and QA.

02

The method is model-agnostic and post-hoc, requiring no retraining.

03

It outperforms existing length-normalized uncertainty measures.

Abstract

Large Language Models (LLMs) have become indispensable tools across various applications, making it more important than ever to ensure the quality and the trustworthiness of their outputs. This has led to growing interest in uncertainty quantification (UQ) methods for assessing the reliability of LLM outputs. Many existing UQ techniques rely on token probabilities, which inadvertently introduces a bias with respect to the length of the output. While some methods attempt to account for this, we demonstrate that such biases persist even in length-normalized approaches. To address the problem, here we propose UNCERTAINTY-LINE: (Length-INvariant Estimation), a simple debiasing procedure that regresses uncertainty scores on output length and uses the residuals as corrected, length-invariant estimates. Our method is post-hoc, model-agnostic, and applicable to a range of UQ measures. Through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

UNCERTAINTY-LINE: Length-Invariant Estimation of Uncertainty for Large Language Models· underline

Taxonomy

TopicsTopic Modeling