A Theory of Time-Sensitive Language Generation: Sparse Hallucination Beats Mode Collapse

Atul Ganju; Travis McVoy; Shaddin Dughmi; Shang-Hua Teng

arXiv:2605.11302·cs.LG·May 21, 2026

A Theory of Time-Sensitive Language Generation: Sparse Hallucination Beats Mode Collapse

Atul Ganju, Travis McVoy, Shaddin Dughmi, Shang-Hua Teng

PDF

TL;DR

This paper introduces a theoretical framework for time-sensitive language generation, demonstrating that hallucination can be minimized over time and identifying conditions for successful generation under deadlines.

Contribution

It formalizes the concept of time-sensitive language generation and proves when hallucination can be reduced or eliminated based on consistency and deadline functions.

Findings

01

Timely generation is impossible for strongly consistent generators.

02

A vanishing hallucination rate allows optimal density with superlinear deadlines.

03

Linear deadlines with vanishing hallucination rate make timely generation impossible.

Abstract

We study language generation in the limit under a global preference ordering on strings, as introduced by Kleinberg and Wei. As is done in previous work, we aim for breadth, but impose an additional requirement of timeliness: higher-ranked strings should be generated earlier. A string is then only credited if it is generated before a deadline, where its deadline is defined by a function that maps a string's rank in the target language to the time by which it must be produced. This is in keeping with a central consideration in machine learning, where inductive bias favors ``simpler'' or ``more plausible'' outputs, all else being equal. We show that timely generation is impossible in a strong sense for eventually consistent generators -- the protagonists of most prior related work. Under what is perhaps the mildest natural relaxation of consistency, a hallucination rate that vanishes over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.