Contrastive Unsupervised Word Alignment with Non-Local Features

Yang Liu; Maosong Sun

arXiv:1410.2082·cs.CL·October 13, 2014

Contrastive Unsupervised Word Alignment with Non-Local Features

Yang Liu, Maosong Sun

PDF

Open Access

TL;DR

This paper introduces a contrastive unsupervised method for word alignment that efficiently incorporates non-local features, leading to significant improvements over existing methods.

Contribution

It proposes a novel contrastive learning approach that simplifies expectation calculations for non-local features in unsupervised word alignment.

Findings

01

Achieves significant improvements over state-of-the-art methods

02

Uses top-n alignments to approximate expectations efficiently

03

Effectively incorporates non-local features in unsupervised learning

Abstract

Word alignment is an important natural language processing task that indicates the correspondence between natural languages. Recently, unsupervised learning of log-linear models for word alignment has received considerable attention as it combines the merits of generative and discriminative approaches. However, a major challenge still remains: it is intractable to calculate the expectations of non-local features that are critical for capturing the divergence between natural languages. We propose a contrastive approach that aims to differentiate observed training examples from noises. It not only introduces prior knowledge to guide unsupervised learning but also cancels out partition functions. Based on the observation that the probability mass of log-linear models for word alignment is usually highly concentrated, we propose to use top-n alignments to approximate the expectations with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis