Graph-based Uncertainty Metrics for Long-form Language Model Outputs
Mingjian Jiang, Yangjun Ruan, Prasanna Sattigeri, Salim, Roukos, Tatsunori Hashimoto

TL;DR
This paper introduces Graph Uncertainty, a novel graph-based method for estimating claim-level uncertainty in long-form language model outputs, improving factuality and informativeness.
Contribution
It proposes a new graph-based framework for granular uncertainty estimation and develops uncertainty-aware decoding techniques for better long-form text generation.
Findings
6.8% average relative gain in AUPRC for uncertainty estimation
2-4% improvement in factuality of generated responses
Enhanced informativeness of long-form outputs
Abstract
Recent advancements in Large Language Models (LLMs) have significantly improved text generation capabilities, but these systems are still known to hallucinate, and granular uncertainty estimation for long-form LLM generations remains challenging. In this work, we propose Graph Uncertainty -- which represents the relationship between LLM generations and claims within them as a bipartite graph and estimates the claim-level uncertainty with a family of graph centrality metrics. Under this view, existing uncertainty estimation methods based on the concept of self-consistency can be viewed as using degree centrality as an uncertainty measure, and we show that more sophisticated alternatives such as closeness centrality provide consistent gains at claim-level uncertainty estimation. Moreover, we present uncertainty-aware decoding techniques that leverage both the graph structure and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel-Driven Software Engineering Techniques · Semantic Web and Ontologies · Natural Language Processing Techniques
