Understanding Textual Emotion Through Emoji Prediction

Ethan Gordon; Nishank Kuppa; Rigved Tummala; Sriram Anasuri

arXiv:2508.10222·cs.CL·August 15, 2025

Understanding Textual Emotion Through Emoji Prediction

Ethan Gordon, Nishank Kuppa, Rigved Tummala, Sriram Anasuri

PDF

TL;DR

This study evaluates various deep learning models for emoji prediction from short texts, highlighting BERT's overall superiority and CNN's effectiveness on rare emojis, emphasizing architecture choice and tuning for better sentiment understanding.

Contribution

It compares four deep learning architectures for emoji prediction, demonstrating the impact of model selection and hyperparameter tuning on performance.

Findings

01

BERT achieves the highest overall accuracy.

02

CNN performs best on rare emoji classes.

03

Model choice significantly affects sentiment-aware emoji prediction.

Abstract

This project explores emoji prediction from short text sequences using four deep learning architectures: a feed-forward network, CNN, transformer, and BERT. Using the TweetEval dataset, we address class imbalance through focal loss and regularization techniques. Results show BERT achieves the highest overall performance due to its pre-training advantage, while CNN demonstrates superior efficacy on rare emoji classes. This research shows the importance of architecture selection and hyperparameter tuning for sentiment-aware emoji prediction, contributing to improved human-computer interaction.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.