Locally Measuring Cross-lingual Lexical Alignment: A Domain and Word   Level Perspective

Taelin Karidi; Eitan Grossman; Omri Abend

arXiv:2410.07239·cs.CL·October 11, 2024

Locally Measuring Cross-lingual Lexical Alignment: A Domain and Word Level Perspective

Taelin Karidi, Eitan Grossman, Omri Abend

PDF

Open Access 1 Video

TL;DR

This paper introduces a local, domain-specific approach to cross-lingual lexical alignment, utilizing new metrics and validation methods to assess how well translation equivalents share meaning across diverse languages.

Contribution

It presents a novel methodology and metrics for evaluating lexical alignment at the word and domain level, incorporating naturalistic validation and analysis across 16 languages.

Findings

01

Significant room for improvement with newer language models.

02

New metrics based on contextualized embeddings show promise.

03

Analysis highlights the importance of local and domain-specific alignment evaluation.

Abstract

NLP research on aligning lexical representation spaces to one another has so far focused on aligning language spaces in their entirety. However, cognitive science has long focused on a local perspective, investigating whether translation equivalents truly share the same meaning or the extent that cultural and regional influences result in meaning variations. With recent technological advances and the increasing amounts of available data, the longstanding question of cross-lingual lexical alignment can now be approached in a more data-driven manner. However, developing metrics for the task requires some methodology for comparing metric efficacy. We address this gap and present a methodology for analyzing both synthetic validations and a novel naturalistic validation using lexical gaps in the kinship domain. We further propose new metrics, hitherto unexplored on this task, based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Locally Measuring Cross-lingual Lexical Alignment: A Domain and Word Level Perspective· underline

Taxonomy

TopicsNatural Language Processing Techniques