Concatenated Power Mean Word Embeddings as Universal Cross-Lingual   Sentence Representations

Andreas R\"uckl\'e; Steffen Eger; Maxime Peyrard; Iryna Gurevych

arXiv:1803.01400·cs.CL·September 13, 2018·74 cites

Concatenated Power Mean Word Embeddings as Universal Cross-Lingual Sentence Representations

Andreas R\"uckl\'e, Steffen Eger, Maxime Peyrard, Iryna Gurevych

PDF

Open Access 1 Repo

TL;DR

This paper introduces concatenated power mean word embeddings, a simple yet effective method that improves sentence representations both monolingually and cross-lingually, outperforming complex models and recent baselines.

Contribution

It generalizes average embeddings to power means and demonstrates their effectiveness when concatenated, especially in cross-lingual tasks.

Findings

01

Outperforms state-of-the-art monolingual sentence embeddings.

02

Significantly outperforms recent baselines like SIF and Sent2Vec.

03

Enhances cross-lingual sentence representation quality.

Abstract

Average word embeddings are a common baseline for more sophisticated sentence embedding techniques. However, they typically fall short of the performances of more complex models such as InferSent. Here, we generalize the concept of average word embeddings to power mean word embeddings. We show that the concatenation of different types of power mean word embeddings considerably closes the gap to state-of-the-art methods monolingually and substantially outperforms these more complex techniques cross-lingually. In addition, our proposed method outperforms different recently proposed baselines such as SIF and Sent2Vec by a solid margin, thus constituting a much harder-to-beat monolingual baseline. Our data and code are publicly available.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UKPLab/arxiv2018-xling-sentence-embeddings
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining