Contrastive Learning of Emoji-based Representations for Resource-Poor Languages
Nurendra Choudhary, Rajat Singh, Ishita Bindlish, Manish, Shrivastava

TL;DR
This paper introduces CESNA, a contrastive learning model using Siamese Bi-LSTM networks to learn emoji-based representations for resource-poor languages by leveraging resource-rich languages, improving emoji prediction accuracy.
Contribution
The paper presents a novel Siamese network architecture that jointly trains resource-poor and resource-rich languages for emoji representation learning, outperforming existing methods.
Findings
CESNA outperforms state-of-the-art emoji prediction models.
The model effectively learns shared emoji-based representations across languages.
Experiments on Twitter datasets demonstrate improved accuracy in resource-poor languages.
Abstract
The introduction of emojis (or emoticons) in social media platforms has given the users an increased potential for expression. We propose a novel method called Classification of Emojis using Siamese Network Architecture (CESNA) to learn emoji-based representations of resource-poor languages by jointly training them with resource-rich languages using a siamese network. CESNA model consists of twin Bi-directional Long Short-Term Memory Recurrent Neural Networks (Bi-LSTM RNN) with shared parameters joined by a contrastive loss function based on a similarity metric. The model learns the representations of resource-poor and resource-rich language in a common emoji space by using a similarity metric based on the emojis present in sentences from both languages. The model, hence, projects sentences with similar emojis closer to each other and the sentences with different emojis farther from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Sentiment Analysis and Opinion Mining
MethodsSiamese Network
