Distribution-based Prediction of the Degree of Grammaticalization for German Prepositions
Dominik Schlechtweg, Sabine Schulte im Walde

TL;DR
This paper investigates how the degree of grammaticalization of German prepositions relates to corpus-based measures like entropy, frequency, and context types, finding that frequency and context diversity are better indicators than entropy.
Contribution
The study introduces a corpus-based approach to predict grammaticalization levels of German prepositions using quantitative measures, highlighting the importance of frequency and context diversity.
Findings
Frequency correlates strongly with grammaticalization degree.
Number of context types shows a strong correlation.
Entropy has a moderate correlation with grammaticalization.
Abstract
We test the hypothesis that the degree of grammaticalization of German prepositions correlates with their corpus-based contextual dispersion measured by word entropy. We find that there is indeed a moderate correlation for entropy, but a stronger correlation for frequency and number of context types.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
