Emotion Detection in Reddit: Comparative Study of Machine Learning and Deep Learning Techniques
Maliheh Alaeddini

TL;DR
This paper compares various machine learning and deep learning models for emotion detection in Reddit comments, finding that a Stacking classifier outperforms other models and EmoBERTa, with deployment in a web app.
Contribution
It introduces a comprehensive comparison of models for Reddit emotion detection and demonstrates the superior performance of a Stacking classifier over existing models.
Findings
Stacking classifier achieves highest accuracy among tested models.
Stacking classifier outperforms EmoBERTa in benchmark tests.
Web deployment showcases practical application potential.
Abstract
Emotion detection is pivotal in human communication, as it significantly influences behavior, relationships, and decision-making processes. This study concentrates on text-based emotion detection by leveraging the GoEmotions dataset, which annotates Reddit comments with 27 distinct emotions. These emotions are subsequently mapped to Ekman's six basic categories: joy, anger, fear, sadness, disgust, and surprise. We employed a range of models for this task, including six machine learning models, three ensemble models, and a Long Short-Term Memory (LSTM) model to determine the optimal model for emotion detection. Results indicate that the Stacking classifier outperforms other models in accuracy and performance. We also benchmark our models against EmoBERTa, a pre-trained emotion detection model, with our Stacking classifier proving more effective. Finally, the Stacking classifier is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Sentiment Analysis and Opinion Mining
