A Multimodal Framework for the Detection of Hateful Memes
Phillip Lippe, Nithin Holla, Shantanu Chandra, Santhosh Rajamanickam,, Georgios Antoniou, Ekaterina Shutova, Helen Yannakoudakis

TL;DR
This paper introduces a multimodal framework for detecting hateful memes, leveraging ensemble learning and contrastive example upsampling to improve accuracy in identifying online hate speech.
Contribution
It advances multimodal hate speech detection by enhancing existing models with ensemble techniques and data augmentation strategies, achieving state-of-the-art performance.
Findings
Achieved an AUROC score of 80.53 on the Hateful Memes Challenge
Ensemble of UNITER-based models improves detection robustness
Upsampling contrastive examples enhances multimodal reasoning
Abstract
An increasingly common expression of online hate speech is multimodal in nature and comes in the form of memes. Designing systems to automatically detect hateful content is of paramount importance if we are to mitigate its undesirable effects on the society at large. The detection of multimodal hate speech is an intrinsically difficult and open problem: memes convey a message using both images and text and, hence, require multimodal reasoning and joint visual and language understanding. In this work, we seek to advance this line of research and develop a multimodal framework for the detection of hateful memes. We improve the performance of existing multimodal approaches beyond simple fine-tuning and, among others, show the effectiveness of upsampling of contrastive examples to encourage multimodality and ensemble learning based on cross-validation to improve robustness. We furthermore…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Humor Studies and Applications · Sentiment Analysis and Opinion Mining
