Subdiffusive semantic evolution in Indo-European languages
Bogd\'an Asztalos, Gergely Palla, D\'aniel Cz\'egel

TL;DR
This study reveals that semantic change in Indo-European languages follows a universal subdiffusive pattern, indicating constrained and anomalously slow evolution of word meanings over time.
Contribution
The paper demonstrates that semantic evolution is universally subdiffusive across five major Indo-European languages, using a novel diachronic semantic embedding pipeline and stochastic modeling.
Findings
Semantic change follows a subdiffusive pattern with an exponent of about 0.45.
Preserving temporal correlations is essential to observe subdiffusive behavior.
Subdiffusion remains robust under various data analysis methods.
Abstract
How do words change their meaning? Although semantic evolution is driven by a variety of distinct factors, including linguistic, societal, and technological ones, we find that there is one law that holds universally across five major Indo-European languages: that semantic evolution is strongly subdiffusive. Using an automated pipeline of diachronic distributional semantic embedding that controls for underlying symmetries, we show that words follow stochastic trajectories in meaning space with an anomalous diffusion exponent across languages, in contrast with diffusing particles that follow . Randomization methods indicate that preserving temporal correlations in semantic change directions is necessary to recover strongly subdiffusive behavior; however, correlations in change sizes play an important role too. We furthermore show that strong subdiffusion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Opinion Dynamics and Social Influence · Complex Systems and Time Series Analysis
MethodsDiffusion
