Contrastive Unsupervised Word Alignment with Non-Local Features
Yang Liu, Maosong Sun

TL;DR
This paper introduces a contrastive unsupervised method for word alignment that efficiently incorporates non-local features, leading to significant improvements over existing methods.
Contribution
It proposes a novel contrastive learning approach that simplifies expectation calculations for non-local features in unsupervised word alignment.
Findings
Achieves significant improvements over state-of-the-art methods
Uses top-n alignments to approximate expectations efficiently
Effectively incorporates non-local features in unsupervised learning
Abstract
Word alignment is an important natural language processing task that indicates the correspondence between natural languages. Recently, unsupervised learning of log-linear models for word alignment has received considerable attention as it combines the merits of generative and discriminative approaches. However, a major challenge still remains: it is intractable to calculate the expectations of non-local features that are critical for capturing the divergence between natural languages. We propose a contrastive approach that aims to differentiate observed training examples from noises. It not only introduces prior knowledge to guide unsupervised learning but also cancels out partition functions. Based on the observation that the probability mass of log-linear models for word alignment is usually highly concentrated, we propose to use top-n alignments to approximate the expectations with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
