LoGU: Long-form Generation with Uncertainty Expressions

Ruihan Yang; Caiqi Zhang; Zhisong Zhang; Xinting Huang; Sen Yang; Nigel Collier; Dong Yu; Deqing Yang

arXiv:2410.14309·cs.CL·June 5, 2025

LoGU: Long-form Generation with Uncertainty Expressions

Ruihan Yang, Caiqi Zhang, Zhisong Zhang, Xinting Huang, Sen Yang, Nigel Collier, Dong Yu, Deqing Yang

PDF

Open Access 1 Repo 2 Models 1 Video

TL;DR

This paper introduces LoGU, a method for improving long-form generation by enabling models to better express and calibrate uncertainty, thereby reducing hallucinations and increasing factual accuracy in responses.

Contribution

The paper presents a novel framework and training pipeline for long-form generation with uncertainty expression, addressing challenges of suppression and misalignment.

Findings

01

Significantly reduces hallucinations in long-form responses

02

Improves factual accuracy and response comprehensiveness

03

Enhances uncertainty calibration in language models

Abstract

While Large Language Models (LLMs) demonstrate impressive capabilities, they still struggle with generating factually incorrect content (i.e., hallucinations). A promising approach to mitigate this issue is enabling models to express uncertainty when unsure. Previous research on uncertainty modeling has primarily focused on short-form QA, but realworld applications often require much longer responses. In this work, we introduce the task of Long-form Generation with Uncertainty(LoGU). We identify two key challenges: Uncertainty Suppression, where models hesitate to express uncertainty, and Uncertainty Misalignment, where models convey uncertainty inaccurately. To tackle these challenges, we propose a refinement-based data collection framework and a two-stage training pipeline. Our framework adopts a divide-and-conquer strategy, refining uncertainty based on atomic claims. The collected…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rhyang2021/logu
noneOfficial

Models

Videos

LoGU: Long-form Generation with Uncertainty Expressions· underline

Taxonomy

TopicsModel-Driven Software Engineering Techniques