Deciphering Implicit Hate: Evaluating Automated Detection Algorithms for Multimodal Hate
Austin Botelho, Bertie Vidgen, Scott A. Hale

TL;DR
This paper evaluates the effectiveness of semantic and multimodal context-aware models in detecting both implicit and explicit online hate, demonstrating that multimodal models outperform unimodal ones, especially in challenging cases with annotator disagreement.
Contribution
It introduces a high-quality multimodal hate speech dataset and compares the performance of unimodal and multimodal models, highlighting the advantages of multimodal approaches in hate detection.
Findings
Multimodal models achieve higher F1 scores (0.771) than unimodal models.
Models perform better on content with full annotator agreement.
Multimodal models excel at classifying content with annotator disagreement.
Abstract
Accurate detection and classification of online hate is a difficult task. Implicit hate is particularly challenging as such content tends to have unusual syntax, polysemic words, and fewer markers of prejudice (e.g., slurs). This problem is heightened with multimodal content, such as memes (combinations of text and images), as they are often harder to decipher than unimodal content (e.g., text alone). This paper evaluates the role of semantic and multimodal context for detecting implicit and explicit hate. We show that both text- and visual- enrichment improves model performance, with the multimodal model (0.771) outperforming other models' F1 scores (0.544, 0.737, and 0.754). While the unimodal-text context-aware (transformer) model was the most accurate on the subtask of implicit hate detection, the multimodal model outperformed it overall because of a lower propensity towards false…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Spam and Phishing Detection
