An Autoencoder Approach to Learning Bilingual Word Representations

Sarath Chandar A P; Stanislas Lauly; Hugo Larochelle; Mitesh M.; Khapra; Balaraman Ravindran; Vikas Raykar; Amrita Saha

arXiv:1402.1454·cs.CL·February 7, 2014·252 cites

An Autoencoder Approach to Learning Bilingual Word Representations

Sarath Chandar A P, Stanislas Lauly, Hugo Larochelle, Mitesh M., Khapra, Balaraman Ravindran, Vikas Raykar, Amrita Saha

PDF

Open Access

TL;DR

This paper introduces an autoencoder-based method for learning bilingual word representations without requiring word-level alignments, improving cross-language classification performance significantly.

Contribution

It presents a novel autoencoder approach with correlation regularization for bilingual embedding learning that does not depend on word alignments.

Findings

01

Achieves 10-14 percentage point improvements in cross-language test classification.

02

Demonstrates high-quality bilingual word representations without word alignments.

03

Proposes and compares variations of autoencoders for this task.

Abstract

Cross-language learning allows us to use training data from one language to build models for a different language. Many approaches to bilingual learning require that we have word-level alignment of sentences from parallel corpora. In this work we explore the use of autoencoder-based methods for cross-language learning of vectorial word representations that are aligned between two languages, while not relying on word-level alignments. We show that by simply learning to reconstruct the bag-of-words representations of aligned sentences, within and between languages, we can in fact learn high-quality representations and do without word alignments. Since training autoencoders on word observations presents certain computational issues, we propose and compare different variations adapted to this setting. We also propose an explicit correlation maximizing regularizer that leads to significant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis