Complexity counts: global and local perspectives on Indo-Aryan numeral systems
Chundra Cathcart

TL;DR
This paper analyzes the unusual complexity of Indo-Aryan numeral systems, quantifies their intricacy using cross-linguistic metrics, and explores cultural factors influencing their persistence and variation.
Contribution
It introduces new metrics for measuring numeral system complexity and provides a cross-linguistic analysis of Indo-Aryan languages' numeral systems and their underlying factors.
Findings
Indo-Aryan languages have more complex numeral systems than most languages.
Individual Indo-Aryan languages vary significantly in their numeral complexity.
Despite high complexity, Indo-Aryan systems follow general efficiency pressures.
Abstract
The numeral systems of Indo-Aryan languages such as Hindi, Gujarati, and Bengali are highly unusual in that unlike most numeral systems (e.g., those of English, Chinese, etc.), forms referring to 1--99 are highly non-transparent and cannot be constructed using straightforward rules for forming combinations of tens and digits. As an example, Hindi/Urdu {\it iky\=anve} `91' is not decomposable into the composite elements {\it ek} `one' and {\it nave} `ninety' in the way that its English counterpart is. This paper further clarifies the position of Indo-Aryan languages within the typology of numeral systems, and explores the linguistic and non-linguistic factors that may be responsible for the persistence of complex systems in these languages. Using data from multiple databases, we develop and employ a number of cross-linguistically applicable metrics to quantify the complexity of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
