Multimodal Hate Speech Detection from Bengali Memes and Texts
Md. Rezaul Karim, Sumon Kanti Dey, Tanhim Islam, Md. Shajalal, and Bharathi Raja Chakravarthi

TL;DR
This paper introduces the first multimodal Bengali hate speech dataset and evaluates neural models combining text and images, showing that multimodal analysis improves detection accuracy.
Contribution
The paper creates a novel multimodal Bengali hate speech dataset and benchmarks state-of-the-art neural architectures for joint analysis of text and images.
Findings
XLM-RoBERTa + DenseNet-161 achieved the highest F1 score of 0.83.
Text modality is more effective than images alone for hate speech detection.
Multimodal fusion improves detection accuracy over unimodal approaches.
Abstract
Numerous machine learning (ML) and deep learning (DL)-based approaches have been proposed to utilize textual data from social media for anti-social behavior analysis like cyberbullying, fake news detection, and identification of hate speech mainly for highly-resourced languages such as English. However, despite having a lot of diversity and millions of native speakers, some languages like Bengali are under-resourced, which is due to a lack of computational resources for natural language processing (NLP). Similar to other languages, Bengali social media contents also include images along with texts (e.g., multimodal memes are posted by embedding short texts into images on Facebook). Therefore, only the textual data is not enough to judge them since images might give extra context to make a proper judgement. This paper is about hate speech detection from multimodal Bengali memes and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Adam · Multi-Head Attention · Residual Connection · Depthwise Convolution · Attention Dropout · Pointwise Convolution · Softmax
