Why is unsupervised alignment of English embeddings from different   algorithms so hard?

Mareike Hartmann; Yova Kementchedjhieva; Anders S{\o}gaard

arXiv:1809.00150·cs.CL·September 5, 2018

Why is unsupervised alignment of English embeddings from different algorithms so hard?

Mareike Hartmann, Yova Kementchedjhieva, Anders S{\o}gaard

PDF

Open Access

TL;DR

This paper investigates why unsupervised alignment of English word embeddings from different algorithms is difficult, revealing that algorithm biases create complex optimization landscapes hindering GAN-based alignment.

Contribution

It highlights the role of inductive biases in embedding algorithms in causing alignment difficulties and demonstrates the existence of linear transforms between embeddings.

Findings

01

GANs can align embeddings from the same algorithm but fail across different algorithms.

02

Differences in embedding algorithms lead to complex optimization landscapes with local optima.

03

Linear transforms exist between embeddings from different algorithms, but alignment remains challenging.

Abstract

This paper presents a challenge to the community: Generative adversarial networks (GANs) can perfectly align independent English word embeddings induced using the same algorithm, based on distributional information alone; but fails to do so, for two different embeddings algorithms. Why is that? We believe understanding why, is key to understand both modern word embedding algorithms and the limitations and instability dynamics of GANs. This paper shows that (a) in all these cases, where alignment fails, there exists a linear transform between the two embeddings (so algorithm biases do not lead to non-linear differences), and (b) similar effects can not easily be obtained by varying hyper-parameters. One plausible suggestion based on our initial experiments is that the differences in the inductive biases of the embedding algorithms lead to an optimization landscape that is riddled with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis · Natural Language Processing Techniques