Columbia MVSO Image Sentiment Dataset

Vaidehi Dalmia; Hongyi Liu; Shih-Fu Chang

arXiv:1611.04455·cs.MM·November 15, 2016·2 cites

Columbia MVSO Image Sentiment Dataset

Vaidehi Dalmia, Hongyi Liu, Shih-Fu Chang

PDF

Open Access

TL;DR

This paper introduces the Columbia MVSO Image Sentiment Dataset, a large multilingual image sentiment dataset with human annotations, designed to benchmark automatic sentiment prediction systems in images.

Contribution

It provides a new, annotated benchmark dataset for evaluating image sentiment analysis systems, focusing on multilingual and adjective-noun pair concepts.

Findings

01

Dataset includes 15,600 concepts in 12 languages.

02

Human judgments collected for 3,911 English ANPs.

03

Dataset aims to serve as a benchmark for sentiment prediction systems.

Abstract

The Multilingual Visual Sentiment Ontology (MVSO) consists of 15,600 concepts in 12 different languages that are strongly related to emotions and sentiments expressed in images. These concepts are defined in the form of Adjective-Noun Pair (ANP), which are crawled and discovered from online image forum Flickr. In this work, we used Amazon Mechanical Turk as a crowd-sourcing platform to collect human judgments on sentiments expressed in images that are uniformly sampled over 3,911 English ANPs extracted from a tag-restricted subset of MVSO. Our goal is to use the dataset as a benchmark for the evaluation of systems that automatically predict sentiments in images or ANPs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Sentiment Analysis and Opinion Mining · Image Retrieval and Classification Techniques