Recursive numeral systems are highly regular and easy to process
Ponrawee Prasertsom, Andrea Silvi, Jennifer Culbertson, Moa Johansson, Devdatt Dubhashi, Kenny Smith

TL;DR
This paper demonstrates that recursive numeral systems are highly regular and efficiently processed, with regularity explaining their natural shape and distinguishing them from less natural systems, using an MDL-based approach.
Contribution
It introduces an MDL-based framework to quantify regularity and processing efficiency in recursive numeral systems, explaining natural language patterns without ad-hoc constraints.
Findings
Recursive systems are more regular and efficient according to MDL measures.
Natural systems differ from theoretical ones mainly in regularity and processing complexity.
Regularity naturally constrains the shape of recursive numeral systems.
Abstract
Much recent work has shown how cross-linguistic variation is constrained by competing pressures from efficient communication. However, little attention has been paid to the role of the systematicity of forms (regularity), a key property of natural language. Here, we demonstrate the importance of regularity in explaining the shape of linguistic systems by looking at recursive numeral systems. Previous work has argued that these systems optimise the trade-off between lexicon size and average morphosyntatic complexity (Deni\'c and Szymanik, 2024). However, showing that only natural-language-like systems optimise this trade-off has proven elusive, and existing solutions rely on ad-hoc constraints to rule out unnatural systems (Yang and Regier, 2025). Drawing on the Minimum Description Length (MDL) approach, we argue that recursive numeral systems are better viewed as efficient with regard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsLanguage and cultural evolution · Language Development and Disorders · Syntax, Semantics, Linguistic Variation
