Benchmark dataset of memes with text transcriptions for automatic detection of multi-modal misogynistic content
Francesca Gasparini, Giulia Rizzi, Aurora Saibene, Elisabetta Fersini

TL;DR
This paper introduces a benchmark dataset of 800 memes with text transcriptions, annotated for misogynistic content, to facilitate research on automatic detection of online misogyny using multimodal analysis.
Contribution
The paper provides a balanced, validated dataset of memes with expert and crowd labels, including textual transcriptions, for advancing multimodal misogyny detection research.
Findings
Dataset includes 800 memes with balanced misogynistic and non-misogynistic labels.
Annotations include misogyny, aggressiveness, and irony labels from experts and crowdsourcing.
Text transcriptions are manually provided for each meme.
Abstract
In this paper we present a benchmark dataset generated as part of a project for automatic identification of misogyny within online content, which focuses in particular on memes. The benchmark here described is composed of 800 memes collected from the most popular social media platforms, such as Facebook, Twitter, Instagram and Reddit, and consulting websites dedicated to collection and creation of memes. To gather misogynistic memes, specific keywords that refer to misogynistic content have been considered as search criterion, considering different manifestations of hatred against women, such as body shaming, stereotyping, objectification and violence. In parallel, memes with no misogynist content have been manually downloaded from the same web sources. Among all the collected memes, three domain experts have selected a dataset of 800 memes equally balanced between misogynistic and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHumor Studies and Applications · Gender, Feminism, and Media · Hate Speech and Cyberbullying Detection
