LLM Uncertainty Quantification through Directional Entailment Graph and   Claim Level Response Augmentation

Longchao Da; Tiejin Chen; Lu Cheng; Hua Wei

arXiv:2407.00994·cs.CL·July 9, 2024

LLM Uncertainty Quantification through Directional Entailment Graph and Claim Level Response Augmentation

Longchao Da, Tiejin Chen, Lu Cheng, Hua Wei

PDF

Open Access

TL;DR

This paper introduces a novel method for quantifying LLM uncertainty using a directional entailment graph and eigenvalue analysis, along with an augmentation technique to improve response clarity, validated through extensive experiments.

Contribution

It presents a new approach to measure LLM uncertainty via directional graphs and eigenvalues, and proposes an augmentation method to reduce vagueness in responses.

Findings

01

The proposed uncertainty measure effectively captures directional instability.

02

The augmentation approach reduces vagueness in LLM responses.

03

Empirical results demonstrate the superiority of the methods over baselines.

Abstract

The Large language models (LLMs) have showcased superior capabilities in sophisticated tasks across various domains, stemming from basic question-answer (QA), they are nowadays used as decision assistants or explainers for unfamiliar content. However, they are not always correct due to the data sparsity in specific domain corpus, or the model's hallucination problems. Given this, how much should we trust the responses from LLMs? This paper presents a novel way to evaluate the uncertainty that captures the directional instability, by constructing a directional graph from entailment probabilities, and we innovatively conduct Random Walk Laplacian given the asymmetric property of a constructed directed graph, then the uncertainty is aggregated by the derived eigenvalues from the Laplacian process. We also provide a way to incorporate the existing work's semantics uncertainty with our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Safety Analysis · Fault Detection and Control Systems

MethodsSparse Evolutionary Training