Trust in One Round: Confidence Estimation for Large Language Models via Structural Signals

Pengyue Yang; Jiawen Wen; Haolin Jin; Linghan Huang; Huaming Chen; Ling Chen

arXiv:2602.00977·cs.CL·February 3, 2026

Trust in One Round: Confidence Estimation for Large Language Models via Structural Signals

Pengyue Yang, Jiawen Wen, Haolin Jin, Linghan Huang, Huaming Chen, Ling Chen

PDF

Open Access

TL;DR

This paper introduces Structural Confidence, a novel, single-pass, model-agnostic method that improves LLM output correctness estimation by analyzing internal structural signals, outperforming traditional confidence estimators across diverse tasks.

Contribution

The work presents a new framework leveraging multi-scale structural signals from LLMs' hidden states, enabling efficient, robust confidence estimation without multiple stochastic samples or auxiliary models.

Findings

01

Outperforms baselines in AUROC and AUPR across four benchmarks

02

Uses a single deterministic pass for confidence estimation

03

Effective across diverse, domain-specific tasks

Abstract

Large language models (LLMs) are increasingly deployed in domains where errors carry high social, scientific, or safety costs. Yet standard confidence estimators, such as token likelihood, semantic similarity and multi-sample consistency, remain brittle under distribution shift, domain-specialised text, and compute limits. In this work, we present Structural Confidence, a single-pass, model-agnostic framework that enhances output correctness prediction based on multi-scale structural signals derived from a model's final-layer hidden-state trajectory. By combining spectral, local-variation, and global shape descriptors, our method captures internal stability patterns that are missed by probabilities and sentence embeddings. We conduct extensive, cross-domain evaluation across four heterogeneous benchmarks-FEVER (fact verification), SciFact (scientific claims), WikiBio-hallucination…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Computational and Text Analysis Methods