Are Automatic Methods for Cognate Detection Good Enough for Phylogenetic   Reconstruction in Historical Linguistics?

Taraka Rama; Johann-Mattis List; Johannes Wahle; Gerhard J\"ager

arXiv:1804.05416·cs.CL·April 17, 2018

Are Automatic Methods for Cognate Detection Good Enough for Phylogenetic Reconstruction in Historical Linguistics?

Taraka Rama, Johann-Mattis List, Johannes Wahle, Gerhard J\"ager

PDF

1 Repo

TL;DR

This paper assesses how well automatic cognate detection algorithms perform in phylogenetic reconstruction of language families compared to traditional manual methods, finding they are a promising tool for broad linguistic analyses.

Contribution

It provides an empirical evaluation of automatic cognate detection methods' effectiveness in phylogenetic inference, highlighting their potential as a practical alternative to manual annotation.

Findings

01

Automated cognate sets produce phylogenies close to expert-annotated ones.

02

Automatic methods are especially useful for large-scale or exploratory studies.

03

Manual annotations still yield slightly more accurate phylogenies.

Abstract

We evaluate the performance of state-of-the-art algorithms for automatic cognate detection by comparing how useful automatically inferred cognates are for the task of phylogenetic inference compared to classical manually annotated cognate sets. Our findings suggest that phylogenies inferred from automated cognate sets come close to phylogenies inferred from expert-annotated ones, although on average, the latter are still superior. We conclude that future work on phylogenetic reconstruction can profit much from automatic cognate detection. Especially where scholars are merely interested in exploring the bigger picture of a language family's phylogeny, algorithms for automatic cognate detection are a useful complement for current research on language phylogenies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PhyloStar/AutoCogPhylo
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.