Deep Learning Paradigm with Transformed Monolingual Word Embeddings for   Multilingual Sentiment Analysis

Yujie Lu; Tatsunori Mori

arXiv:1710.03203·cs.CL·October 11, 2017·1 cites

Deep Learning Paradigm with Transformed Monolingual Word Embeddings for Multilingual Sentiment Analysis

Yujie Lu, Tatsunori Mori

PDF

Open Access

TL;DR

This paper introduces a novel deep learning framework for multilingual sentiment analysis that maps monolingual embeddings into a shared space, avoiding reliance on machine translation and improving accuracy.

Contribution

It proposes a new paradigm combining monolingual embedding mapping and parameter-sharing neural networks for effective multilingual sentiment analysis.

Findings

01

CNN model outperforms baseline by 2.1% accuracy

02

Shared embedding space improves cross-lingual sentiment classification

03

Method reduces dependency on machine translation quality

Abstract

The surge of social media use brings huge demand of multilingual sentiment analysis (MSA) for unveiling cultural difference. So far, traditional methods resorted to machine translation---translating texts in other languages to English, and then adopt the methods once worked in English. However, this paradigm is conditioned by the quality of machine translation. In this paper, we propose a new deep learning paradigm to assimilate the differences between languages for MSA. We first pre-train monolingual word embeddings separately, then map word embeddings in different spaces into a shared embedding space, and then finally train a parameter-sharing deep neural network for MSA. The experimental results show that our paradigm is effective. Especially, our CNN model outperforms a state-of-the-art baseline by around 2.1% in terms of classification accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Text and Document Classification Technologies · Topic Modeling