Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models
Minh Duc Bui, Katharina von der Wense, Anne Lauscher

TL;DR
This paper introduces Multi3Hate, a multilingual, multicultural dataset of memes for hate speech detection, and evaluates vision-language models' ability to understand cultural nuances in hate speech across five languages.
Contribution
It creates the first multimodal, multilingual hate speech dataset with multicultural annotations and analyzes how cultural backgrounds influence model performance and annotation agreement.
Findings
Cultural background significantly impacts hate speech annotation.
Models align more with US annotations than others, regardless of meme language.
Annotator agreement varies greatly across cultures, with only 67-74% agreement.
Abstract
Warning: this paper contains content that may be offensive or upsetting Hate speech moderation on global platforms poses unique challenges due to the multimodal and multilingual nature of content, along with the varying cultural perceptions. How well do current vision-language models (VLMs) navigate these nuances? To investigate this, we create the first multimodal and multilingual parallel hate speech dataset, annotated by a multicultural set of annotators, called Multi3Hate. It contains 300 parallel meme samples across 5 languages: English, German, Spanish, Hindi, and Mandarin. We demonstrate that cultural background significantly affects multimodal hate speech annotation in our dataset. The average pairwise agreement among countries is just 74%, significantly lower than that of randomly selected annotator groups. Our qualitative analysis indicates that the lowest pairwise label…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHate Speech and Cyberbullying Detection
MethodsALIGN · Sparse Evolutionary Training
