Contrastive Identification and Generation in the Limit
Xiaoyu Li, Andi Han, Jiaojiao Jiang, Junbin Gao

TL;DR
This paper explores contrastive identification and generation in the limit, analyzing how learners infer hypotheses from unordered pairs with hidden labels, introducing new theoretical concepts and characterizations.
Contribution
It provides the first formal study of contrastive learning in the limit, including characterizations, dimensions, and hierarchy results, especially under adversarial corruption.
Findings
Characterized contrastive identifiable classes with a geometric condition.
Defined contrastive closure dimension and analyzed uniform contrastive generation.
Proved a hierarchy and sharp reversal results under adversarial corruption.
Abstract
In the classical identification in the limit model of Gold [1967], a stream of positive examples is presented round by round, and the learner must eventually recover the target hypothesis. Recently, Kleinberg and Mullainathan [2024] introduced generation in the limit, where the learner instead must eventually output novel elements of the target's support. Both lines of work focus on positive-only or fully labeled data. Yet many natural supervision signals are inherently relational rather than singleton, which encode relationships between examples rather than labels of individual ones. We initiate the study of contrastive identification and generation in the limit, where the learner observes a contrastive presentation of data: a stream of unordered pairs satisfying for an unknown target binary hypothesis , but which element is positive is hidden from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
