Quantitative methods for Phylogenetic Inference in Historical Linguistics: An experimental case study of South Central Dravidian
Taraka Rama, Sudheer Kolachina, Lakshmi Bai B

TL;DR
This study evaluates the effectiveness of genetic-inspired quantitative algorithms in reconstructing language family trees, demonstrating their general agreement with traditional linguistic methods and highlighting their potential in historical linguistics.
Contribution
It applies distance and discrete character methods from genetics to linguistic data, showing their utility in phylogenetic inference for languages.
Findings
Phylogenetic trees largely agree with traditional linguistic trees
Minor differences are due to genuine ambiguity in data
Quantitative methods are useful for predicting language relationships
Abstract
In this paper we examine the usefulness of two classes of algorithms Distance Methods, Discrete Character Methods (Felsenstein and Felsenstein 2003) widely used in genetics, for predicting the family relationships among a set of related languages and therefore, diachronic language change. Applying these algorithms to the data on the numbers of shared cognates- with-change and changed as well as unchanged cognates for a group of six languages belonging to a Dravidian language sub-family given in Krishnamurti et al. (1983), we observed that the resultant phylogenetic trees are largely in agreement with the linguistic family tree constructed using the comparative method of reconstruction with only a few minor differences. Furthermore, we studied these minor differences and found that they were cases of genuine ambiguity even for a well-trained historical linguist. We evaluated the trees…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Natural Language Processing Techniques · Linguistic Variation and Morphology
