Finding Good Representations of Emotions for Text Classification
Ji Ho Park

TL;DR
This paper develops emotion-aware text representations at word and sentence levels, improving sentiment and emotion analysis while addressing gender bias in models through neural network training on large, weakly labeled datasets.
Contribution
It introduces emotion-specific word and sentence embeddings trained on large weakly labeled datasets, and explores bias reduction techniques for more robust affective text classification.
Findings
Emotion-aware embeddings improve sentiment analysis accuracy.
Large-scale training on hashtags and emojis enhances representation quality.
Bias reduction methods lead to more equitable models.
Abstract
It is important for machines to interpret human emotions properly for better human-machine communications, as emotion is an essential part of human-to-human communications. One aspect of emotion is reflected in the language we use. How to represent emotions in texts is a challenge in natural language processing (NLP). Although continuous vector representations like word2vec have become the new norm for NLP problems, their limitations are that they do not take emotions into consideration and can unintentionally contain bias toward certain identities like different genders. This thesis focuses on improving existing representations in both word and sentence levels by explicitly taking emotions inside text and model bias into account in their training process. Our improved representations can help to build more robust machine learning models for affect-related text classification like…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining
