Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception

Shiyu Ni; Keping Bi; Jiafeng Guo; Lulu Yu; Baolong Bi; Xueqi Cheng

arXiv:2502.11677·cs.CL·June 26, 2025

Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception

Shiyu Ni, Keping Bi, Jiafeng Guo, Lulu Yu, Baolong Bi, Xueqi Cheng

PDF

Open Access 1 Video

TL;DR

This paper investigates how large language models can better perceive their knowledge boundaries by leveraging internal states, improving confidence estimation, efficiency, and risk control through novel calibration methods.

Contribution

It introduces a confidence calibration method ($C^3$) that enhances LLMs' ability to recognize knowledge gaps and improves their reliability in critical tasks.

Findings

01

LLMs show significant pre-response confidence perception.

02

Post-generation perception further refines confidence estimates.

03

The $C^3$ method increases unknown perception rate by over 5%.

Abstract

Large language models (LLMs) exhibit impressive performance across diverse tasks but often struggle to accurately gauge their knowledge boundaries, leading to confident yet incorrect responses. This paper explores leveraging LLMs' internal states to enhance their perception of knowledge boundaries from efficiency and risk perspectives. We investigate whether LLMs can estimate their confidence using internal states before response generation, potentially saving computational resources. Our experiments on datasets like Natural Questions, HotpotQA, and MMLU reveal that LLMs demonstrate significant pre-generation perception, which is further refined post-generation, with perception gaps remaining stable across varying conditions. To mitigate risks in critical domains, we introduce Confidence Consistency-based Calibration ( $C^{3}$ ), which assesses confidence consistency through question…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception· underline

Taxonomy

TopicsAdvanced Data Storage Technologies