The statistical trade-off between word order and word structure - large-scale evidence for the principle of least effort
Alexander Koplenig, Peter Meyer, Sascha Wolfer, and Carolin, Mueller-Spitzer

TL;DR
This study provides large-scale evidence that languages balance the use of word order and internal word structure to convey grammatical information, reflecting an efficient trade-off principle across diverse languages and texts.
Contribution
It offers the first large-scale quantitative analysis demonstrating a statistical trade-off between word order and word structure in over 1,100 languages using biblical translations.
Findings
Languages with more reliance on word order use less internal word structure.
A recurring trade-off exists across different biblical books in how information is encoded.
Languages encode information in diverse yet efficient ways.
Abstract
Languages employ different strategies to transmit structural and grammatical information. While, for example, grammatical dependency relationships in sentences are mainly conveyed by the ordering of the words for languages like Mandarin Chinese, or Vietnamese, the word ordering is much less restricted for languages such as Inupiatun or Quechua, as those languages (also) use the internal structure of words (e.g. inflectional morphology) to mark grammatical relationships in a sentence. Based on a quantitative analysis of more than 1,500 unique translations of different books of the Bible in more than 1,100 different languages that are spoken as a native language by approximately 6 billion people (more than 80% of the world population), we present large-scale evidence for a statistical trade-off between the amount of information conveyed by the ordering of words and the amount of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
