Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs

Xiaomin Li; Zhou Yu; Ziji Zhang; Yingying Zhuang; Swair Shah; Narayanan Sadagopan; Anurag Beniwal

arXiv:2502.21239·cs.CL·November 13, 2025

Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs

Xiaomin Li, Zhou Yu, Ziji Zhang, Yingying Zhuang, Swair Shah, Narayanan Sadagopan, Anurag Beniwal

PDF

Open Access 1 Video

TL;DR

Semantic Volume introduces a new mathematical measure to quantify and detect both external and internal uncertainties in large language models, improving their reliability without internal access.

Contribution

The paper proposes Semantic Volume, a novel, generalizable, and unsupervised method for quantifying uncertainty in LLMs by perturbing queries and responses and analyzing their semantic dispersion.

Findings

01

Outperforms existing uncertainty detection baselines

02

Effectively detects both external and internal uncertainties

03

Provides theoretical links to differential entropy

Abstract

Large language models (LLMs) have demonstrated remarkable performance across diverse tasks by encoding vast amounts of factual knowledge. However, they are still prone to hallucinations, generating incorrect or misleading information, often accompanied by high uncertainty. Existing methods for hallucination detection primarily focus on quantifying internal uncertainty, which arises from missing or conflicting knowledge within the model. However, hallucinations can also stem from external uncertainty, where ambiguous user queries lead to multiple possible interpretations. In this work, we introduce Semantic Volume, a novel mathematical measure for quantifying both external and internal uncertainty in LLMs. Our approach perturbs queries and responses, embeds them in a semantic space, and computes the Gram matrix determinant of the embedding vectors, capturing their dispersion as a measure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Semantic Volume: Quantifying and Detecting Both External and Internal Uncertainty in LLMs· underline

Taxonomy

TopicsSemantic Web and Ontologies · Natural Language Processing Techniques

MethodsFocus