A Theory of Time-Sensitive Language Generation: Sparse Hallucination Beats Mode Collapse
Atul Ganju, Travis McVoy, Shaddin Dughmi, Shang-Hua Teng

TL;DR
This paper introduces a theoretical framework for time-sensitive language generation, demonstrating that hallucination can be minimized over time and identifying conditions for successful generation under deadlines.
Contribution
It formalizes the concept of time-sensitive language generation and proves when hallucination can be reduced or eliminated based on consistency and deadline functions.
Findings
Timely generation is impossible for strongly consistent generators.
A vanishing hallucination rate allows optimal density with superlinear deadlines.
Linear deadlines with vanishing hallucination rate make timely generation impossible.
Abstract
We study language generation in the limit under a global preference ordering on strings, as introduced by Kleinberg and Wei. As is done in previous work, we aim for breadth, but impose an additional requirement of timeliness: higher-ranked strings should be generated earlier. A string is then only credited if it is generated before a deadline, where its deadline is defined by a function that maps a string's rank in the target language to the time by which it must be produced. This is in keeping with a central consideration in machine learning, where inductive bias favors ``simpler'' or ``more plausible'' outputs, all else being equal. We show that timely generation is impossible in a strong sense for eventually consistent generators -- the protagonists of most prior related work. Under what is perhaps the mildest natural relaxation of consistency, a hallucination rate that vanishes over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
