Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification
Zhenpeng Chen, Sheng Shen, Ziniu Hu, Xuan Lu, Qiaozhu Mei, and Xuanzhe Liu

TL;DR
This paper introduces a novel emoji-based representation learning approach that enhances cross-lingual sentiment classification by capturing language-specific sentiment patterns, achieving state-of-the-art results especially with limited labeled data.
Contribution
It proposes using emojis as a new channel for learning sentiment-aware representations, improving cross-lingual transfer and handling low-resource scenarios.
Findings
Achieves state-of-the-art performance on benchmark datasets.
Effectively captures language-specific sentiment patterns.
Maintains high accuracy with scarce sentiment labels.
Abstract
Sentiment classification typically relies on a large amount of labeled data. In practice, the availability of labels is highly imbalanced among different languages, e.g., more English texts are labeled than texts in any other languages, which creates a considerable inequality in the quality of related information services received by users speaking different languages. To tackle this problem, cross-lingual sentiment classification approaches aim to transfer knowledge learned from one language that has abundant labeled examples (i.e., the source language, usually English) to another language with fewer labels (i.e., the target language). The source and the target languages are usually bridged through off-the-shelf machine translation tools. Through such a channel, cross-language sentiment patterns can be successfully learned from English and transferred into the target languages. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Natural Language Processing Techniques
