Precise Information Control in Long-Form Text Generation

Jacqueline He; Howard Yen; Margaret Li; Shuyue Stella Li; Zhiyuan Zeng; Weijia Shi; Yulia Tsvetkov; Danqi Chen; Pang Wei Koh; Luke Zettlemoyer

arXiv:2506.06589·cs.CL·October 2, 2025

Precise Information Control in Long-Form Text Generation

Jacqueline He, Howard Yen, Margaret Li, Shuyue Stella Li, Zhiyuan Zeng, Weijia Shi, Yulia Tsvetkov, Danqi Chen, Pang Wei Koh, Luke Zettlemoyer

PDF

Open Access 1 Video

TL;DR

This paper introduces Precise Information Control (PIC), a new task and benchmark for long-form text generation that emphasizes faithfulness to input claims, revealing current models' hallucination issues and proposing a training framework to improve factual accuracy.

Contribution

The paper formulates PIC as a new task for grounded long-form generation, creates PIC-Bench benchmark, and develops a weakly supervised training method to significantly enhance model faithfulness.

Findings

01

State-of-the-art LMs hallucinate over 70% of the time against input claims.

02

The PIC-LM improves F1 score from 69.1% to 91.0% in the full PIC setting.

03

PIC-LM enhances factual accuracy metrics in downstream tasks.

Abstract

A central challenge in language models (LMs) is faithfulness hallucination: the generation of information unsubstantiated by input context. To study this problem, we propose Precise Information Control (PIC), a new task formulation that requires models to generate long-form outputs grounded in a provided set of short self-contained statements, without adding any unsupported ones. PIC includes a full setting that tests a model's ability to include exactly all input claims, and a partial setting that requires the model to selectively incorporate only relevant claims. We present PIC-Bench, a benchmark of eight long-form generation tasks (e.g., summarization, biography generation) adapted to the PIC setting, where LMs are supplied with well-formed, verifiable input claims. Our evaluation of a range of open and proprietary LMs on PIC-Bench reveals that, surprisingly, state-of-the-art LMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Precise Information Control in Long-Form Text Generation· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications