# Decoding Sentiment from Distributed Representations of Sentences

**Authors:** Edoardo Maria Ponti, Ivan Vuli\'c, Anna Korhonen

arXiv: 1705.06369 · 2017-09-07

## TL;DR

This paper investigates how well distributed sentence representations encode sentiment across diverse languages, revealing that no single architecture is best universally and that simple models can outperform complex ones depending on language-specific features.

## Contribution

It systematically compares different unsupervised sentence representation architectures for sentiment decoding across multiple languages, highlighting language-dependent performance and the effectiveness of additive models.

## Key findings

- No single architecture outperforms others across all languages.
- Additive models based on skip-gram vectors can surpass supervised models like bidirectional LSTMs.
- Performance varies depending on language-specific negative constructions.

## Abstract

Distributed representations of sentences have been developed recently to represent their meaning as real-valued vectors. However, it is not clear how much information such representations retain about the polarity of sentences. To study this question, we decode sentiment from unsupervised sentence representations learned with different architectures (sensitive to the order of words, the order of sentences, or none) in 9 typologically diverse languages. Sentiment results from the (recursive) composition of lexical items and grammatical strategies such as negation and concession. The results are manifold: we show that there is no `one-size-fits-all' representation architecture outperforming the others across the board. Rather, the top-ranking architectures depend on the language and data at hand. Moreover, we find that in several cases the additive composition model based on skip-gram word vectors may surpass supervised state-of-art architectures such as bidirectional LSTMs. Finally, we provide a possible explanation of the observed variation based on the type of negative constructions in each language.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.06369/full.md

## Figures

19 figures with captions in the complete paper: https://tomesphere.com/paper/1705.06369/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/1705.06369/full.md

---
Source: https://tomesphere.com/paper/1705.06369