Evaluating Models of Robust Word Recognition with Serial Reproduction
Stephan C. Meylan, Sathvik Nair, Thomas L. Griffiths

TL;DR
This study assesses how well different probabilistic language models predict human expectations in noisy spoken communication, finding models with abstract phrase structure representations most accurately reflect how people reproduce and modify utterances.
Contribution
It introduces serial reproduction as a method to evaluate language models against human spoken language expectations, highlighting the importance of abstract syntactic representations.
Findings
Models with phrase structure best predict utterance modifications.
Serial reproduction effectively reveals linguistic expectations.
Abstract representations improve model-human alignment.
Abstract
Spoken communication occurs in a "noisy channel" characterized by high levels of environmental noise, variability within and between speakers, and lexical and syntactic ambiguity. Given these properties of the received linguistic input, robust spoken word recognition -- and language processing more generally -- relies heavily on listeners' prior knowledge to evaluate whether candidate interpretations of that input are more or less likely. Here we compare several broad-coverage probabilistic generative language models in their ability to capture human linguistic expectations. Serial reproduction, an experimental paradigm where spoken utterances are reproduced by successive participants similar to the children's game of "Telephone," is used to elicit a sample that reflects the linguistic expectations of English-speaking adults. When we evaluate a suite of probabilistic generative language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLogistic Regression
