Learning Bilingual Word Representations by Marginalizing Alignments

Tom\'a\v{s} Ko\v{c}isk\'y; Karl Moritz Hermann; Phil Blunsom

arXiv:1405.0947·cs.CL·May 6, 2014·42 cites

Learning Bilingual Word Representations by Marginalizing Alignments

Tom\'a\v{s} Ko\v{c}isk\'y, Karl Moritz Hermann, Phil Blunsom

PDF

Open Access

TL;DR

This paper introduces a probabilistic model that learns bilingual word representations by marginalizing over alignments, capturing broader semantic context and improving cross-lingual classification performance.

Contribution

It proposes a novel probabilistic approach that marginalizes alignments to learn richer bilingual word embeddings, outperforming previous methods.

Findings

01

Outperforms previous state-of-the-art in cross-lingual classification

02

Captures larger semantic context than hard alignment models

03

Demonstrates effectiveness of marginalized alignment approach

Abstract

We present a probabilistic model that simultaneously learns alignments and distributed representations for bilingual data. By marginalizing over word alignments the model captures a larger semantic context than prior work relying on hard alignments. The advantage of this approach is demonstrated in a cross-lingual classification task, where we outperform the prior published state of the art.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis