# A Survey Of Cross-lingual Word Embedding Models

**Authors:** Sebastian Ruder, Ivan Vuli\'c, Anders S{\o}gaard

arXiv: 1706.04902 · 2019-10-08

## TL;DR

This survey comprehensively reviews cross-lingual word embedding models, analyzing their types, data needs, objectives, evaluation methods, and future research directions in multilingual NLP.

## Contribution

It provides a detailed typology and comparison of cross-lingual word embedding models, highlighting their similarities and differences, and discusses evaluation and future challenges.

## Key findings

- Many models optimize similar objectives
- Models are often equivalent under different strategies
- Evaluation methods vary across models

## Abstract

Cross-lingual representations of words enable us to reason about word meaning in multilingual contexts and are a key facilitator of cross-lingual transfer when developing natural language processing models for low-resource languages. In this survey, we provide a comprehensive typology of cross-lingual word embedding models. We compare their data requirements and objective functions. The recurring theme of the survey is that many of the models presented in the literature optimize for the same objectives, and that seemingly different models are often equivalent modulo optimization strategies, hyper-parameters, and such. We also discuss the different ways cross-lingual word embeddings are evaluated, as well as future challenges and research horizons.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1706.04902/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/1706.04902/full.md

## References

216 references — full list in the complete paper: https://tomesphere.com/paper/1706.04902/full.md

---
Source: https://tomesphere.com/paper/1706.04902