Mistake-Bounded Language Generation

Jon Kleinberg; Charlotte Peale; Omer Reingold

arXiv:2605.10809·cs.LG·May 12, 2026

Mistake-Bounded Language Generation

Jon Kleinberg, Charlotte Peale, Omer Reingold

PDF

TL;DR

This paper introduces a new mistake-bounded framework for language generation, focusing on minimizing total errors during learning rather than just eventual correctness, with algorithms and bounds for finite and infinite classes.

Contribution

It develops a formal reduction to a known learning framework, providing mistake bounds and trade-offs for finite and infinite language classes, including extensions to noisy adversaries.

Findings

01

Algorithm achieves optimal last-mistake time for finite classes.

02

Mistake bound of logarithmic order for finite classes.

03

Trade-off between mistake bounds and convergence in infinite classes.

Abstract

We investigate the learning task of language generation in the limit, but shift focus from the traditional time-of-last-mistake metric of a generator's success to a new notion of "mistake-bounded generation." While existing results for language generation in the limit focus on guaranteeing eventual consistency, they are blind to the cumulative error incurred during the learning process. We address this by shifting the goal to minimizing the total number of invalid elements output by a generation algorithm. We establish a formal reduction to the Learning from Correct Demonstrations framework of Joshi et al. (2025), enabling a general recipe for deriving mistake bounds via weighted update rules. For finite classes, we provide an algorithm that simultaneously achieves an optimal last-mistake time of $Cdim (L)$ and a mistake bound of $⌊ lo g_{2} ∣ L ∣ ⌋$ , whereas for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.