Transformed Protoform Reconstruction

Young Min Kim; Kalvin Chang; Chenxuan Cui; David Mortensen

arXiv:2307.01896·cs.CL·July 7, 2023

Transformed Protoform Reconstruction

Young Min Kim, Kalvin Chang, Chenxuan Cui, David Mortensen

PDF

Open Access 1 Repo

TL;DR

This paper enhances protoform reconstruction by applying a Transformer-based seq2seq model, outperforming previous RNN models on Latin and Chinese datasets, and explores phylogenetic signals in the model.

Contribution

It introduces a Transformer model for protoform reconstruction, achieving better performance than prior RNN-based models and analyzing phylogenetic signals.

Findings

01

Transformer outperforms RNN models on multiple datasets

02

Model achieves state-of-the-art metrics on Latin and Chinese datasets

03

Phylogenetic signals are detectable in the model

Abstract

Protoform reconstruction is the task of inferring what morphemes or words appeared like in the ancestral languages of a set of daughter languages. Meloni et al. (2021) achieved the state-of-the-art on Latin protoform reconstruction with an RNN-based encoder-decoder with attention model. We update their model with the state-of-the-art seq2seq model: the Transformer. Our model outperforms their model on a suite of different metrics on two different datasets: their Romance data of 8,000 cognates spanning 5 languages and a Chinese dataset (Hou 2004) of 800+ cognates spanning 39 varieties. We also probe our model for potential phylogenetic signal contained in the model. Our code is publicly available at https://github.com/cmu-llab/acl-2023.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cmu-llab/acl-2023
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Genomics and Phylogenetic Studies · Language and cultural evolution

MethodsMulti-Head Attention · Attention Is All You Need · Layer Normalization · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · Sigmoid Activation · Label Smoothing · Adam · Position-Wise Feed-Forward Layer