Contrastive Identification and Generation in the Limit

Xiaoyu Li; Andi Han; Jiaojiao Jiang; Junbin Gao

arXiv:2605.06211·cs.LG·May 8, 2026

Contrastive Identification and Generation in the Limit

Xiaoyu Li, Andi Han, Jiaojiao Jiang, Junbin Gao

PDF

TL;DR

This paper explores contrastive identification and generation in the limit, analyzing how learners infer hypotheses from unordered pairs with hidden labels, introducing new theoretical concepts and characterizations.

Contribution

It provides the first formal study of contrastive learning in the limit, including characterizations, dimensions, and hierarchy results, especially under adversarial corruption.

Findings

01

Characterized contrastive identifiable classes with a geometric condition.

02

Defined contrastive closure dimension and analyzed uniform contrastive generation.

03

Proved a hierarchy and sharp reversal results under adversarial corruption.

Abstract

In the classical identification in the limit model of Gold [1967], a stream of positive examples is presented round by round, and the learner must eventually recover the target hypothesis. Recently, Kleinberg and Mullainathan [2024] introduced generation in the limit, where the learner instead must eventually output novel elements of the target's support. Both lines of work focus on positive-only or fully labeled data. Yet many natural supervision signals are inherently relational rather than singleton, which encode relationships between examples rather than labels of individual ones. We initiate the study of contrastive identification and generation in the limit, where the learner observes a contrastive presentation of data: a stream of unordered pairs ${x, y}$ satisfying $h (x) \neq = h (y)$ for an unknown target binary hypothesis $h$ , but which element is positive is hidden from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.