A Characterization of List Language Identification in the Limit
Moses Charikar, Chirag Pabbaraju, Ambuj Tewari

TL;DR
This paper characterizes when collections of languages can be identified in the limit with a list of size k, extending classical results and providing exact conditions and rate analyses for list identification.
Contribution
It offers an exact characterization of k-list learnability in the limit, generalizing Angluin's classic results and analyzing the rates of identification in statistical settings.
Findings
Characterization of k-list identifiable collections based on decompositions.
Exponential rate of list identification for identifiable collections.
Non-zero rate impossibility for non-list identifiable collections.
Abstract
We study the problem of language identification in the limit, where given a sequence of examples from a target language, the goal of the learner is to output a sequence of guesses for the target language such that all the guesses beyond some finite time are correct. Classical results of Gold showed that language identification in the limit is impossible for essentially any interesting collection of languages. Later, Angluin gave a precise characterization of language collections for which this task is possible. Motivated by recent positive results for the related problem of language generation, we revisit the classic language identification problem in the setting where the learner is given the additional power of producing a list of guesses at each time step. The goal is to ensure that beyond some finite time, one of the guesses is correct at each time step. We give an exact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · semigroups and automata theory · Authorship Attribution and Profiling
