Kernel Language Entropy: Fine-grained Uncertainty Quantification for   LLMs from Semantic Similarities

Alexander Nikitin; Jannik Kossen; Yarin Gal; Pekka Marttinen

arXiv:2405.20003·cs.LG·May 31, 2024·2 cites

Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities

Alexander Nikitin, Jannik Kossen, Yarin Gal, Pekka Marttinen

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces Kernel Language Entropy (KLE), a novel semantic uncertainty estimation method for LLMs that improves trustworthiness by capturing fine-grained semantic dependencies and outperforming previous approaches.

Contribution

KLE is a new entropy-based approach that encodes semantic similarities with kernels and provides more detailed uncertainty estimates for LLM outputs.

Findings

01

KLE outperforms previous methods in uncertainty quantification across multiple datasets.

02

Theoretically, KLE generalizes the semantic entropy method.

03

Empirically, KLE improves detection of incorrect model responses.

Abstract

Uncertainty quantification in Large Language Models (LLMs) is crucial for applications where safety and reliability are important. In particular, uncertainty can be used to improve the trustworthiness of LLMs by detecting factually incorrect model responses, commonly called hallucinations. Critically, one should seek to capture the model's semantic uncertainty, i.e., the uncertainty over the meanings of LLM outputs, rather than uncertainty over lexical or syntactic variations that do not affect answer correctness. To address this problem, we propose Kernel Language Entropy (KLE), a novel method for uncertainty estimation in white- and black-box LLMs. KLE defines positive semidefinite unit trace kernels to encode the semantic similarities of LLM outputs and quantifies uncertainty using the von Neumann entropy. It considers pairwise semantic dependencies between answers (or semantic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling