Language Distribution Prediction based on Batch Markov Monte Carlo Simulation with Migration
XingYu Fu, ZiYi Yang, XiuWen Duan

TL;DR
This paper introduces a novel simulation method, BMMCSM, to model language spread considering migration, population dynamics, and geographic distribution, with an emphasis on estimating transition probabilities using machine learning.
Contribution
The paper presents the BMMCSM algorithm for simulating language spread, incorporating migration, mortality, and fertility, and introduces a machine learning approach to estimate transition matrices.
Findings
Simulation results match real-world cultural and economic trends.
The Random Forest approach effectively predicts unknown transition probabilities.
Language distribution varies over time in line with global development trends.
Abstract
Language spreading is a complex mechanism that involves issues like culture, economics, migration, population etc. In this paper, we propose a set of methods to model the dynamics of the spreading system. To model the randomness of language spreading, we propose the Batch Markov Monte Carlo Simulation with Migration(BMMCSM) algorithm, in which each agent is treated as a language stack. The agent learns languages and migrates based on the proposed Batch Markov Property according to the transition matrix T and migration matrix M. Since population plays a crucial role in language spreading, we also introduce the Mortality and Fertility Mechanism, which controls the birth and death of the simulated agents, into the BMMCSM algorithm. The simulation results of BMMCSM show that the numerical and geographic distribution of languages varies across the time. The change of distribution fits the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Natural Language Processing Techniques · Human Mobility and Location-Based Analysis
