Recurrent Confidence Chain: Temporal-Aware Uncertainty Quantification in Large Language Models
Zhenjiang Mao, Anirudhh Venkat

TL;DR
This paper introduces a novel temporal-aware uncertainty quantification method for large language models, improving confidence estimation in reasoning tasks by analyzing semantic correlations and historical confidence, leading to better calibration.
Contribution
It proposes inter-step attention and hidden confidence mechanisms to enhance uncertainty estimation in large language models' reasoning processes.
Findings
Outperforms state-of-the-art methods on GAOKAO and CLadder datasets.
Achieves better calibration with lower Expected Calibration Error.
Demonstrates improved predictive quality on reasoning benchmarks.
Abstract
As reasoning modules, such as the chain-of-thought mechanism, are applied to large language models, they achieve strong performance on various tasks such as answering common-sense questions and solving math problems. The main challenge now is to assess the uncertainty of answers, which can help prevent misleading or serious hallucinations for users. Although current methods analyze long reasoning sequences by filtering unrelated tokens and examining potential connections between nearby tokens or sentences, the temporal spread of confidence is often overlooked. This oversight can lead to inflated overall confidence, even when earlier steps exhibit very low confidence. To address this issue, we propose a novel method that incorporates inter-step attention to analyze semantic correlations across steps. For handling long-horizon responses, we introduce a hidden confidence mechanism to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)
