The scarcity of crossing dependencies: a direct outcome of a specific constraint?
Carlos G\'omez-Rodr\'iguez, Ramon Ferrer-i-Cancho

TL;DR
This study shows that the infrequency of crossing dependencies in human languages can be explained by dependency length minimization rather than a specific constraint against crossings, using statistical analysis across multiple languages.
Contribution
It provides evidence that the scarcity of crossings is a side effect of dependency length minimization, not an independent linguistic constraint.
Findings
A simple predictor estimates crossing dependencies with less than 5% error.
Baseline random ordering predictor has at least 6 times higher error.
Crossing frequency is explained by dependency length minimization rather than a dedicated constraint.
Abstract
The structure of a sentence can be represented as a network where vertices are words and edges indicate syntactic dependencies. Interestingly, crossing syntactic dependencies have been observed to be infrequent in human languages. This leads to the question of whether the scarcity of crossings in languages arises from an independent and specific constraint on crossings. We provide statistical evidence suggesting that this is not the case, as the proportion of dependency crossings of sentences from a wide range of languages can be accurately estimated by a simple predictor based on a null hypothesis on the local probability that two dependencies cross given their lengths. The relative error of this predictor never exceeds 5% on average, whereas the error of a baseline predictor assuming a random ordering of the words of a sentence is at least 6 times greater. Our results suggest that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
