Can a Neural Model Guide Fieldwork? A Case Study on Morphological Data   Collection

Aso Mahmudi; Borja Herce; Demian Inostroza Amestica; Andreas; Scherbakov; Eduard Hovy; Ekaterina Vylomova

arXiv:2409.14628·cs.CL·December 17, 2024

Can a Neural Model Guide Fieldwork? A Case Study on Morphological Data Collection

Aso Mahmudi, Borja Herce, Demian Inostroza Amestica, Andreas, Scherbakov, Eduard Hovy, Ekaterina Vylomova

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural model to assist linguists in morphological data collection during fieldwork, improving efficiency through strategic sampling and model confidence guidance, thereby streamlining language documentation efforts.

Contribution

It presents a novel framework for evaluating sampling strategies and neural model effectiveness in morphological data collection, integrating linguist-model interaction dynamics.

Findings

01

Uniform sampling increases data diversity.

02

Model confidence guides more effective annotation.

03

Enhanced interaction improves morphological data collection efficiency.

Abstract

Linguistic fieldwork is an important component in language documentation and preservation. However, it is a long, exhaustive, and time-consuming process. This paper presents a novel model that guides a linguist during the fieldwork and accounts for the dynamics of linguist-speaker interactions. We introduce a novel framework that evaluates the efficiency of various sampling strategies for obtaining morphological data and assesses the effectiveness of state-of-the-art neural models in generalising morphological structures. Our experiments highlight two key strategies for improving the efficiency: (1) increasing the diversity of annotated data by uniform sampling among the cells of the paradigm tables, and (2) using model confidence as a guide to enhance positive interaction by providing reliable predictions during annotation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Aso-UniMelb/neural-fieldwork-guide
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications