Detecting Hate Speech in Multi-modal Memes
Abhishek Das, Japsimar Singh Wahi, Siyao Li

TL;DR
This paper addresses hate speech detection in multi-modal memes by exploring visual features and sentiment analysis to improve classification accuracy, overcoming dataset confounders and baseline limitations.
Contribution
It introduces methods combining visual object detection, image captioning, and sentiment analysis to enhance hate speech detection in memes, surpassing prior unimodal and multi-modal baselines.
Findings
Visual features improve detection accuracy
Sentiment analysis enriches feature representation
Proposed methods outperform baseline models
Abstract
In the past few years, there has been a surge of interest in multi-modal problems, from image captioning to visual question answering and beyond. In this paper, we focus on hate speech detection in multi-modal memes wherein memes pose an interesting multi-modal fusion problem. We aim to solve the Facebook Meme Challenge \cite{kiela2020hateful} which aims to solve a binary classification problem of predicting whether a meme is hateful or not. A crucial characteristic of the challenge is that it includes "benign confounders" to counter the possibility of models exploiting unimodal priors. The challenge states that the state-of-the-art models perform poorly compared to humans. During the analysis of the dataset, we realized that majority of the data points which are originally hateful are turned into benign just be describing the image of the meme. Also, majority of the multi-modal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Multimodal Machine Learning Applications · Humor Studies and Applications
