Detecting Hateful Memes Using a Multimodal Deep Ensemble

Vlad Sandulescu

arXiv:2012.13235·cs.LG·December 25, 2020·27 cites

Detecting Hateful Memes Using a Multimodal Deep Ensemble

Vlad Sandulescu

PDF

Open Access 1 Repo

TL;DR

This paper improves multimodal deep learning models for detecting hateful memes, achieving state-of-the-art performance and ranking fifth among over three thousand participants.

Contribution

It introduces enhancements to visual-linguistic Transformer architectures for hate speech detection, significantly boosting accuracy.

Findings

01

Model outperforms baseline methods by a large margin

02

Achieves 5th place on the leaderboard among 3,100+ teams

03

Demonstrates the effectiveness of proposed improvements

Abstract

While significant progress has been made using machine learning algorithms to detect hate speech, important technical challenges still remain to be solved in order to bring their performance closer to human accuracy. We investigate several of the most recent visual-linguistic Transformer architectures and propose improvements to increase their performance for this task. The proposed model outperforms the baselines by a large margin and ranks 5 $^{t h}$ on the leaderboard out of 3,100+ participants.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vladsandulescu/hatefulmemes
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Humor Studies and Applications · Sentiment Analysis and Opinion Mining

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Softmax · Byte Pair Encoding · Label Smoothing · Adam · Dense Connections · Layer Normalization · Attention Is All You Need