Learning Phonotactics from Linguistic Informants
Canaan Breiss, Alexis Ross, Amani Maina-Kilaas, Roger Levy, Jacob, Andreas

TL;DR
This paper introduces an interactive model that learns phonotactic rules by querying a linguistic informant, demonstrating high sample efficiency in both natural and generated languages.
Contribution
It presents a novel interactive learning framework that uses information-theoretic policies to efficiently learn phonotactic constraints from linguistic informants.
Findings
Achieves comparable or better sample efficiency than fully supervised methods.
Effective in both natural and procedurally-generated languages.
Demonstrates the utility of interactive, informant-based learning for phonotactics.
Abstract
We propose an interactive approach to language learning that utilizes linguistic acceptability judgments from an informant (a competent language user) to learn a grammar. Given a grammar formalism and a framework for synthesizing data, our model iteratively selects or synthesizes a data-point according to one of a range of information-theoretic policies, asks the informant for a binary judgment, and updates its own parameters in preparation for the next query. We demonstrate the effectiveness of our model in the domain of phonotactics, the rules governing what kinds of sound-sequences are acceptable in a language, and carry out two experiments, one with typologically-natural linguistic data and another with a range of procedurally-generated languages. We find that the information-theoretic policies that our model uses to select items to query the informant achieve sample efficiency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems
