LLM Uncertainty Quantification through Directional Entailment Graph and Claim Level Response Augmentation
Longchao Da, Tiejin Chen, Lu Cheng, Hua Wei

TL;DR
This paper introduces a novel method for quantifying LLM uncertainty using a directional entailment graph and eigenvalue analysis, along with an augmentation technique to improve response clarity, validated through extensive experiments.
Contribution
It presents a new approach to measure LLM uncertainty via directional graphs and eigenvalues, and proposes an augmentation method to reduce vagueness in responses.
Findings
The proposed uncertainty measure effectively captures directional instability.
The augmentation approach reduces vagueness in LLM responses.
Empirical results demonstrate the superiority of the methods over baselines.
Abstract
The Large language models (LLMs) have showcased superior capabilities in sophisticated tasks across various domains, stemming from basic question-answer (QA), they are nowadays used as decision assistants or explainers for unfamiliar content. However, they are not always correct due to the data sparsity in specific domain corpus, or the model's hallucination problems. Given this, how much should we trust the responses from LLMs? This paper presents a novel way to evaluate the uncertainty that captures the directional instability, by constructing a directional graph from entailment probabilities, and we innovatively conduct Random Walk Laplacian given the asymmetric property of a constructed directed graph, then the uncertainty is aggregated by the derived eigenvalues from the Laplacian process. We also provide a way to incorporate the existing work's semantics uncertainty with our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Safety Analysis · Fault Detection and Control Systems
MethodsSparse Evolutionary Training
