Are Girls Neko or Sh\=ojo? Cross-Lingual Alignment of Non-Isomorphic   Embeddings with Iterative Normalization

Mozhi Zhang; Keyulu Xu; Ken-ichi Kawarabayashi; Stefanie Jegelka,; Jordan Boyd-Graber

arXiv:1906.01622·cs.CL·November 12, 2019·5 cites

Are Girls Neko or Sh\=ojo? Cross-Lingual Alignment of Non-Isomorphic Embeddings with Iterative Normalization

Mozhi Zhang, Keyulu Xu, Ken-ichi Kawarabayashi, Stefanie Jegelka,, Jordan Boyd-Graber

PDF

Open Access 1 Repo

TL;DR

This paper introduces Iterative Normalization, a method to improve cross-lingual word embeddings for non-isomorphic language pairs by normalizing embeddings, significantly enhancing translation accuracy especially for challenging pairs like English-Japanese.

Contribution

The paper proposes a novel normalization technique that facilitates better orthogonal alignment of non-isomorphic embeddings, improving multilingual NLP performance.

Findings

01

Significant accuracy improvements on English-Japanese translation

02

Consistent enhancement across three CLWE methods

03

Largest gain from 2% to 44% accuracy on Japanese-English pair

Abstract

Cross-lingual word embeddings (CLWE) underlie many multilingual natural language processing systems, often through orthogonal transformations of pre-trained monolingual embeddings. However, orthogonal mapping only works on language pairs whose embeddings are naturally isomorphic. For non-isomorphic pairs, our method (Iterative Normalization) transforms monolingual embeddings to make orthogonal alignment easier by simultaneously enforcing that (1) individual word vectors are unit length, and (2) each language's average vector is zero. Iterative Normalization consistently improves word translation accuracy of three CLWE methods, with the largest improvement observed on English-Japanese (from 2% to 44% test accuracy).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhangmozhi/iternorm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification