Neural Proto-Language Reconstruction

Chenxuan Cui; Ying Chen; Qinxin Wang; David R. Mortensen

arXiv:2404.15690·cs.CL·April 25, 2024·2 cites

Neural Proto-Language Reconstruction

Chenxuan Cui, Ying Chen, Qinxin Wang, David R. Mortensen

PDF

Open Access

TL;DR

This paper explores advanced neural network models, including Transformers with VAE and data augmentation, to improve the automation and accuracy of proto-language reconstruction, a traditionally manual linguistic task.

Contribution

It introduces novel neural approaches combining VAE and data augmentation to enhance proto-language reconstruction accuracy and training stability.

Findings

01

VAE-augmented Transformer outperforms previous models on WikiHan dataset.

02

Data augmentation stabilizes training process.

03

Neural machine translation models are effective for reconstruction tasks.

Abstract

Proto-form reconstruction has been a painstaking process for linguists. Recently, computational models such as RNN and Transformers have been proposed to automate this process. We take three different approaches to improve upon previous methods, including data augmentation to recover missing reflexes, adding a VAE structure to the Transformer model for proto-to-language prediction, and using a neural machine translation model for the reconstruction task. We find that with the additional VAE structure, the Transformer model has a better performance on the WikiHan dataset, and the data augmentation step stabilizes the training.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLanguage and cultural evolution

MethodsAttention Is All You Need · Dropout · Dense Connections · Label Smoothing · Residual Connection · Softmax · Position-Wise Feed-Forward Layer · Linear Layer · Byte Pair Encoding · Absolute Position Encodings