Evaluating Models of Robust Word Recognition with Serial Reproduction

Stephan C. Meylan; Sathvik Nair; Thomas L. Griffiths

arXiv:2101.09788·cs.CL·January 26, 2021

Evaluating Models of Robust Word Recognition with Serial Reproduction

Stephan C. Meylan, Sathvik Nair, Thomas L. Griffiths

PDF

TL;DR

This study assesses how well different probabilistic language models predict human expectations in noisy spoken communication, finding models with abstract phrase structure representations most accurately reflect how people reproduce and modify utterances.

Contribution

It introduces serial reproduction as a method to evaluate language models against human spoken language expectations, highlighting the importance of abstract syntactic representations.

Findings

01

Models with phrase structure best predict utterance modifications.

02

Serial reproduction effectively reveals linguistic expectations.

03

Abstract representations improve model-human alignment.

Abstract

Spoken communication occurs in a "noisy channel" characterized by high levels of environmental noise, variability within and between speakers, and lexical and syntactic ambiguity. Given these properties of the received linguistic input, robust spoken word recognition -- and language processing more generally -- relies heavily on listeners' prior knowledge to evaluate whether candidate interpretations of that input are more or less likely. Here we compare several broad-coverage probabilistic generative language models in their ability to capture human linguistic expectations. Serial reproduction, an experimental paradigm where spoken utterances are reproduced by successive participants similar to the children's game of "Telephone," is used to elicit a sample that reflects the linguistic expectations of English-speaking adults. When we evaluate a suite of probabilistic generative language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLogistic Regression