Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment

Siyu Lai; Zhen Yang; Fandong Meng; Yufeng Chen; Jinan Xu; Jie Zhou

arXiv:2210.04141·cs.CL·October 11, 2022

Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment

Siyu Lai, Zhen Yang, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou

PDF

Open Access 1 Repo

TL;DR

Cross-Align introduces a deep interaction modeling approach for word alignment using cross-attention mechanisms and a two-stage training process, significantly improving alignment accuracy across multiple language pairs.

Contribution

The paper presents a novel deep interaction model with explicit cross-attention layers and a two-stage training framework for enhanced word alignment performance.

Findings

01

Achieves state-of-the-art results on four out of five language pairs.

02

Effectively models deep cross-lingual interactions for better alignment.

03

Two-stage training improves model robustness and accuracy.

Abstract

Word alignment which aims to extract lexicon translation equivalents between source and target sentences, serves as a fundamental tool for natural language processing. Recent studies in this area have yielded substantial improvements by generating alignments from contextualized embeddings of the pre-trained multilingual language models. However, we find that the existing approaches capture few interactions between the input sentence pairs, which degrades the word alignment quality severely, especially for the ambiguous words in the monolingual context. To remedy this problem, we propose Cross-Align to model deep interactions between the input sentence pairs, in which the source and target sentences are encoded separately with the shared self-attention modules in the shallow layers, while cross-lingual interactions are explicitly constructed by the cross-attention modules in the upper…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lisasiyu/cross-align
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification