Improving Uncertainty Quantification in Large Language Models via   Semantic Embeddings

Yashvir S. Grewal; Edwin V. Bonilla; Thang D. Bui

arXiv:2410.22685·cs.LG·October 31, 2024

Improving Uncertainty Quantification in Large Language Models via Semantic Embeddings

Yashvir S. Grewal, Edwin V. Bonilla, Thang D. Bui

PDF

Open Access

TL;DR

This paper introduces a semantic embedding-based method for more accurate and robust uncertainty quantification in large language models, reducing sensitivity to wording differences and computational costs.

Contribution

It presents a novel semantic embedding approach and an amortised model for efficient uncertainty estimation in LLMs, outperforming traditional likelihood-based methods.

Findings

01

More accurate uncertainty estimates across multiple datasets

02

Reduced overestimation of uncertainty due to irrelevant words

03

Lower computational overhead with the amortised approach

Abstract

Accurately quantifying uncertainty in large language models (LLMs) is crucial for their reliable deployment, especially in high-stakes applications. Current state-of-the-art methods for measuring semantic uncertainty in LLMs rely on strict bidirectional entailment criteria between multiple generated responses and also depend on sequence likelihoods. While effective, these approaches often overestimate uncertainty due to their sensitivity to minor wording differences, additional correct information, and non-important words in the sequence. We propose a novel approach that leverages semantic embeddings to achieve smoother and more robust estimation of semantic uncertainty in LLMs. By capturing semantic similarities without depending on sequence likelihoods, our method inherently reduces any biases introduced by irrelevant words in the answers. Furthermore, we introduce an amortised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI)