Leveraging Multilingual Transformers for Hate Speech Detection
Sayar Ghosh Roy, Ujwal Narayan, Tathagata Raha, Zubair Abid, Vasudeva, Varma

TL;DR
This paper demonstrates how multilingual Transformer models can effectively detect and classify hate speech, offensive, and profane content across multiple languages, achieving high accuracy on social media data.
Contribution
It introduces a multilingual Transformer-based approach for hate speech detection and classification, incorporating API features and feature selection analysis.
Findings
Achieved macro F1 scores of 90.29, 81.87, 75.40 for hate speech detection in English, German, Hindi.
Achieved macro F1 scores of 60.70, 53.28, 49.74 for fine-grained classification.
Showed the effectiveness of Perspective API features and multilingual training schemes.
Abstract
Detecting and classifying instances of hate in social media text has been a problem of interest in Natural Language Processing in the recent years. Our work leverages state of the art Transformer language models to identify hate speech in a multilingual setting. Capturing the intent of a post or a comment on social media involves careful evaluation of the language style, semantic content and additional pointers such as hashtags and emojis. In this paper, we look at the problem of identifying whether a Twitter post is hateful and offensive or not. We further discriminate the detected toxic content into one of the following three classes: (a) Hate Speech (HATE), (b) Offensive (OFFN) and (c) Profane (PRFN). With a pre-trained multilingual Transformer-based text encoder at the base, we are able to successfully identify and classify hate speech from multiple languages. On the provided…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Topic Modeling · Natural Language Processing Techniques
MethodsLinear Layer · Feature Selection · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dense Connections · Label Smoothing · Byte Pair Encoding · Multi-Head Attention · Attention Is All You Need · Dropout
