Correlation Does Not Imply Compensation: Complexity and Irregularity in the Lexicon
Amanda Doucette, Ryan Cotterell, Morgan Sonderegger, Timothy J., O'Donnell

TL;DR
This study investigates the complex relationships between morphological irregularity, phonotactic complexity, word length, and frequency across multiple languages, revealing nuanced and sometimes contradictory patterns that challenge prior assumptions.
Contribution
It provides a comprehensive analysis of these relationships across 25 languages using advanced information-theoretic measures, highlighting variability and robustness issues in previous findings.
Findings
Positive relationship between irregularity and complexity within languages
Weak negative relationship between word length and irregularity
Some previous correlations are less robust than previously claimed
Abstract
It has been claimed that within a language, morphologically irregular words are more likely to be phonotactically simple and morphologically regular words are more likely to be phonotactically complex. This inverse correlation has been demonstrated in English for a small sample of words, but has yet to be shown for a larger sample of languages. Furthermore, frequency and word length are known to influence both phonotactic complexity and morphological irregularity, and they may be confounding factors in this relationship. Therefore, we examine the relationships between all pairs of these four variables both to assess the robustness of previous findings using improved methodology and as a step towards understanding the underlying causal relationship. Using information-theoretic measures of phonotactic complexity and morphological irregularity (Pimentel et al., 2020; Wu et al., 2019) on 25…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLexicography and Language Studies
