The New Modality: Emoji Challenges in Prediction, Anticipation, and   Retrieval

Spencer Cappallo; Stacey Svetlichnaya; Pierre Garrigues; Thomas; Mensink; Cees G. M. Snoek

arXiv:1801.10253·cs.CL·February 5, 2018

The New Modality: Emoji Challenges in Prediction, Anticipation, and Retrieval

Spencer Cappallo, Stacey Svetlichnaya, Pierre Garrigues, Thomas, Mensink, Cees G. M. Snoek

PDF

TL;DR

This paper introduces emoji as a distinct modality in multimedia research, presenting a large dataset, baseline prediction models, and retrieval methods to understand and utilize emoji in relation to text and images.

Contribution

It is the first to treat emoji as a separate modality, providing a large dataset, baseline models, and initial approaches for prediction and retrieval tasks.

Findings

01

Baseline models achieve promising emoji prediction accuracy.

02

A large Twitter dataset of emoji usage is introduced.

03

Initial results show potential for emoji-based multimedia retrieval.

Abstract

Over the past decade, emoji have emerged as a new and widespread form of digital communication, spanning diverse social networks and spoken languages. We propose to treat these ideograms as a new modality in their own right, distinct in their semantic structure from both the text in which they are often embedded as well as the images which they resemble. As a new modality, emoji present rich novel possibilities for representation and interaction. In this paper, we explore the challenges that arise naturally from considering the emoji modality through the lens of multimedia research. Specifically, the ways in which emoji can be related to other common modalities such as text and images. To do so, we first present a large scale dataset of real-world emoji usage collected from Twitter. This dataset contains examples of both text-emoji and image-emoji relationships. We present baseline…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.