Benchmark dataset of memes with text transcriptions for automatic   detection of multi-modal misogynistic content

Francesca Gasparini; Giulia Rizzi; Aurora Saibene; Elisabetta Fersini

arXiv:2106.08409·cs.AI·October 7, 2022

Benchmark dataset of memes with text transcriptions for automatic detection of multi-modal misogynistic content

Francesca Gasparini, Giulia Rizzi, Aurora Saibene, Elisabetta Fersini

PDF

Open Access 1 Repo

TL;DR

This paper introduces a benchmark dataset of 800 memes with text transcriptions, annotated for misogynistic content, to facilitate research on automatic detection of online misogyny using multimodal analysis.

Contribution

The paper provides a balanced, validated dataset of memes with expert and crowd labels, including textual transcriptions, for advancing multimodal misogyny detection research.

Findings

01

Dataset includes 800 memes with balanced misogynistic and non-misogynistic labels.

02

Annotations include misogyny, aggressiveness, and irony labels from experts and crowdsourcing.

03

Text transcriptions are manually provided for each meme.

Abstract

In this paper we present a benchmark dataset generated as part of a project for automatic identification of misogyny within online content, which focuses in particular on memes. The benchmark here described is composed of 800 memes collected from the most popular social media platforms, such as Facebook, Twitter, Instagram and Reddit, and consulting websites dedicated to collection and creation of memes. To gather misogynistic memes, specific keywords that refer to misogynistic content have been considered as search criterion, considering different manifestations of hatred against women, such as body shaming, stereotyping, objectification and violence. In parallel, memes with no misogynist content have been manually downloaded from the same web sources. Among all the collected memes, three domain experts have selected a dataset of 800 memes equally balanced between misogynistic and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MIND-Lab/MEME
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHumor Studies and Applications · Gender, Feminism, and Media · Hate Speech and Cyberbullying Detection