Beyond Semantic Entropy: Boosting LLM Uncertainty Quantification with Pairwise Semantic Similarity

Dang Nguyen; Ali Payani; Baharan Mirzasoleiman

arXiv:2506.00245·cs.LG·June 3, 2025

Beyond Semantic Entropy: Boosting LLM Uncertainty Quantification with Pairwise Semantic Similarity

Dang Nguyen, Ali Payani, Baharan Mirzasoleiman

PDF

Open Access

TL;DR

This paper introduces a new uncertainty quantification method for large language models that improves upon semantic entropy by considering pairwise semantic similarities, leading to better detection of hallucinations in longer responses.

Contribution

We propose a simple, effective black-box method extending semantic entropy with pairwise semantic similarity, enhancing uncertainty estimation for LLMs in various tasks.

Findings

01

Outperforms semantic entropy in uncertainty estimation

02

Effective across multiple LLMs and tasks

03

Theoretically generalizes semantic entropy

Abstract

Hallucination in large language models (LLMs) can be detected by assessing the uncertainty of model outputs, typically measured using entropy. Semantic entropy (SE) enhances traditional entropy estimation by quantifying uncertainty at the semantic cluster level. However, as modern LLMs generate longer one-sentence responses, SE becomes less effective because it overlooks two crucial factors: intra-cluster similarity (the spread within a cluster) and inter-cluster similarity (the distance between clusters). To address these limitations, we propose a simple black-box uncertainty quantification method inspired by nearest neighbor estimates of entropy. Our approach can also be easily extended to white-box settings by incorporating token probabilities. Additionally, we provide theoretical results showing that our method generalizes semantic entropy. Extensive empirical results demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques