Mind the Generation Process: Fine-Grained Confidence Estimation During LLM Generation
Jinyi Han, Tingyun Li, Shisong Chen, Jie Shi, Xinyi Wang, Guanglei Yue, Jiaqing Liang, Xin Lin, Liqian Wen, Zulong Chen, Yanghua Xiao

TL;DR
This paper introduces FineCE, a novel method for fine-grained, accurate confidence estimation during large language model generation, improving trustworthiness by predicting confidence scores throughout the text creation process.
Contribution
The paper presents a new confidence estimation approach, FineCE, with a training pipeline, a Backward Confidence Integration strategy, and position selection methods, advancing beyond coarse-grained existing techniques.
Findings
FineCE outperforms classical methods on benchmark datasets.
The approach provides continuous confidence scores during generation.
Experiments validate the effectiveness of the proposed strategies.
Abstract
While large language models (LLMs) have demonstrated remarkable performance across diverse tasks, they fundamentally lack self-awareness and frequently exhibit overconfidence, assigning high confidence scores to incorrect predictions. Accurate confidence estimation is therefore critical for enhancing the trustworthiness and reliability of LLM-generated outputs. However, existing approaches suffer from coarse-grained scoring mechanisms that fail to provide fine-grained, continuous confidence estimates throughout the generation process. To address these limitations, we introduce FineCE, a novel confidence estimation method that delivers accurate, fine-grained confidence scores during text generation. Specifically, we first develop a comprehensive pipeline for constructing training data that effectively captures the underlying probabilistic distribution of LLM responses, and then train a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications · Magnetic confinement fusion research · Reservoir Engineering and Simulation Methods
