On Union-Closedness of Language Generation

Steve Hanneke; Amin Karbasi; Anay Mehrotra; Grigoris Velegkas

arXiv:2506.18642·cs.LG·June 24, 2025

On Union-Closedness of Language Generation

Steve Hanneke, Amin Karbasi, Anay Mehrotra, Grigoris Velegkas

PDF

1 Video

TL;DR

This paper explores the limitations of language generation models in the limit, proving that certain classes cannot be combined or boosted, and addressing open questions about the structure and properties of generatable classes.

Contribution

It resolves open questions by showing finite unions of generatable classes may not be generatable and introduces a novel diagonalization method for language generation.

Findings

01

Finite unions of generatable classes need not be generatable.

02

There exist classes that are non-uniformly generatable but whose union is non-generatable.

03

A new diagonalization approach addresses open questions about uncountable classes and closure properties.

Abstract

We investigate language generation in the limit - a model by Kleinberg and Mullainathan [NeurIPS 2024] and extended by Li, Raman, and Tewari [COLT 2025]. While Kleinberg and Mullainathan proved generation is possible for all countable collections, Li et al. defined a hierarchy of generation notions (uniform, non-uniform, and generatable) and explored their feasibility for uncountable collections. Our first set of results resolve two open questions of Li et al. by proving finite unions of generatable or non-uniformly generatable classes need not be generatable. These follow from a stronger result: there is a non-uniformly generatable class and a uniformly generatable class whose union is non-generatable. This adds to the aspects along which language generation in the limit is different from traditional tasks in statistical learning theory like classification, which are closed under…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

On Union-Closedness of Language Generation· slideslive