Evolutionary forces in language change
Christopher A. Ahern, Mitchell G. Newberry, Robin Clark, Joshua B., Plotkin

TL;DR
This paper applies population genetics methods to analyze language change, distinguishing between stochastic drift and selective forces, and finds that both mechanisms influence linguistic evolution with their effects varying by word frequency.
Contribution
It introduces a rigorous inference framework to differentiate drift from selection in language change, providing empirical evidence and a null model for linguistic evolution.
Findings
Drift's strength varies with word frequency.
Some language changes are driven by selection, others by drift.
Stochasticity plays a significant role in language evolution.
Abstract
Languages and genes are both transmitted from generation to generation, with opportunity for differential reproduction and survivorship of forms. Here we apply a rigorous inference framework, drawn from population genetics, to distinguish between two broad mechanisms of language change: drift and selection. Drift is change that results from stochasticity in transmission and it may occur in the absence of any intrinsic difference between linguistic forms; whereas selection is truly an evolutionary force arising from intrinsic differences -- for example, when one form is preferred by members of the population. Using large corpora of parsed texts spanning the 12th century to the 21st century, we analyze three examples of grammatical changes in English: the regularization of past-tense verbs, the rise of the periphrastic `do', and syntactic variation in verbal negation. We show that we can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
