Misogynistic Meme Detection using Early Fusion Model with Graph Network
Harshvardhan Srivastava

TL;DR
This paper presents an early fusion model combining text and image data with graph networks to detect misogynistic memes, achieving competitive results in SemEval-2022.
Contribution
It introduces a novel multimodal fusion approach using pretrained transformers and graph networks for misogyny detection in memes.
Findings
Achieved competitive results in SemEval-2022 Task 5
Significantly outperformed baseline models
Effective multimodal fusion of text and image data
Abstract
In recent years , there has been an upsurge in a new form of entertainment medium called memes. These memes although seemingly innocuous have transcended onto the boundary of online harassment against women and created an unwanted bias against them . To help alleviate this problem , we propose an early fusion model for prediction and identification of misogynistic memes and its type in this paper for which we participated in SemEval-2022 Task 5 . The model receives as input meme image with its text transcription with a target vector. Given that a key challenge with this task is the combination of different modalities to predict misogyny, our model relies on pretrained contextual representations from different state-of-the-art transformer-based language models and pretrained image pretrained models to get an effective image representation. Our model achieved competitive results on both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Humor Studies and Applications · Digital Games and Media
