Deciphering Implicit Hate: Evaluating Automated Detection Algorithms for   Multimodal Hate

Austin Botelho; Bertie Vidgen; Scott A. Hale

arXiv:2106.05903·cs.CL·June 11, 2021

Deciphering Implicit Hate: Evaluating Automated Detection Algorithms for Multimodal Hate

Austin Botelho, Bertie Vidgen, Scott A. Hale

PDF

Open Access

TL;DR

This paper evaluates the effectiveness of semantic and multimodal context-aware models in detecting both implicit and explicit online hate, demonstrating that multimodal models outperform unimodal ones, especially in challenging cases with annotator disagreement.

Contribution

It introduces a high-quality multimodal hate speech dataset and compares the performance of unimodal and multimodal models, highlighting the advantages of multimodal approaches in hate detection.

Findings

01

Multimodal models achieve higher F1 scores (0.771) than unimodal models.

02

Models perform better on content with full annotator agreement.

03

Multimodal models excel at classifying content with annotator disagreement.

Abstract

Accurate detection and classification of online hate is a difficult task. Implicit hate is particularly challenging as such content tends to have unusual syntax, polysemic words, and fewer markers of prejudice (e.g., slurs). This problem is heightened with multimodal content, such as memes (combinations of text and images), as they are often harder to decipher than unimodal content (e.g., text alone). This paper evaluates the role of semantic and multimodal context for detecting implicit and explicit hate. We show that both text- and visual- enrichment improves model performance, with the multimodal model (0.771) outperforming other models' F1 scores (0.544, 0.737, and 0.754). While the unimodal-text context-aware (transformer) model was the most accurate on the subtask of implicit hate detection, the multimodal model outperformed it overall because of a lower propensity towards false…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Spam and Phishing Detection