Graph-based Uncertainty Metrics for Long-form Language Model Outputs

Mingjian Jiang; Yangjun Ruan; Prasanna Sattigeri; Salim; Roukos; Tatsunori Hashimoto

arXiv:2410.20783·cs.CL·October 29, 2024

Graph-based Uncertainty Metrics for Long-form Language Model Outputs

Mingjian Jiang, Yangjun Ruan, Prasanna Sattigeri, Salim, Roukos, Tatsunori Hashimoto

PDF

Open Access 1 Repo

TL;DR

This paper introduces Graph Uncertainty, a novel graph-based method for estimating claim-level uncertainty in long-form language model outputs, improving factuality and informativeness.

Contribution

It proposes a new graph-based framework for granular uncertainty estimation and develops uncertainty-aware decoding techniques for better long-form text generation.

Findings

01

6.8% average relative gain in AUPRC for uncertainty estimation

02

2-4% improvement in factuality of generated responses

03

Enhanced informativeness of long-form outputs

Abstract

Recent advancements in Large Language Models (LLMs) have significantly improved text generation capabilities, but these systems are still known to hallucinate, and granular uncertainty estimation for long-form LLM generations remains challenging. In this work, we propose Graph Uncertainty -- which represents the relationship between LLM generations and claims within them as a bipartite graph and estimates the claim-level uncertainty with a family of graph centrality metrics. Under this view, existing uncertainty estimation methods based on the concept of self-consistency can be viewed as using degree centrality as an uncertainty measure, and we show that more sophisticated alternatives such as closeness centrality provide consistent gains at claim-level uncertainty estimation. Moreover, we present uncertainty-aware decoding techniques that leverage both the graph structure and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mingjianjiang-1/graph-based-uncertainty
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel-Driven Software Engineering Techniques · Semantic Web and Ontologies · Natural Language Processing Techniques