# Multimodal Emotion Classification

**Authors:** Anurag Illendula, Amit Sheth

arXiv: 1903.12520 · 2019-04-01

## TL;DR

This paper explores how combining text, emojis, and images in social media posts improves emotion classification accuracy, demonstrating that multimodal data provides complementary information and that emoji sense depends on textual context.

## Contribution

It introduces a multimodal approach incorporating domain knowledge of emojis and demonstrates the benefit of combining text, emojis, and images for emotion classification.

## Key findings

- Multimodal data improves emotion classification accuracy.
- Emojis' meaning depends on textual context.
- Best accuracy of 71.98% with 550k posts.

## Abstract

Most NLP and Computer Vision tasks are limited to scarcity of labelled data. In social media emotion classification and other related tasks, hashtags have been used as indicators to label data. With the rapid increase in emoji usage of social media, emojis are used as an additional feature for major social NLP tasks. However, this is less explored in case of multimedia posts on social media where posts are composed of both image and text. At the same time, w.e have seen a surge in the interest to incorporate domain knowledge to improve machine understanding of text. In this paper, we investigate whether domain knowledge for emoji can improve the accuracy of emotion classification task. We exploit the importance of different modalities from social media post for emotion classification task using state-of-the-art deep learning architectures. Our experiments demonstrate that the three modalities (text, emoji and images) encode different information to express emotion and therefore can complement each other. Our results also demonstrate that emoji sense depends on the textual context, and emoji combined with text encodes better information than considered separately. The highest accuracy of 71.98\% is achieved with a training data of 550k posts.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.12520/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/1903.12520/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/1903.12520/full.md

---
Source: https://tomesphere.com/paper/1903.12520