Leveraging Multilingual Transformers for Hate Speech Detection

Sayar Ghosh Roy; Ujwal Narayan; Tathagata Raha; Zubair Abid; Vasudeva; Varma

arXiv:2101.03207·cs.CL·January 12, 2021·26 cites

Leveraging Multilingual Transformers for Hate Speech Detection

Sayar Ghosh Roy, Ujwal Narayan, Tathagata Raha, Zubair Abid, Vasudeva, Varma

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates how multilingual Transformer models can effectively detect and classify hate speech, offensive, and profane content across multiple languages, achieving high accuracy on social media data.

Contribution

It introduces a multilingual Transformer-based approach for hate speech detection and classification, incorporating API features and feature selection analysis.

Findings

01

Achieved macro F1 scores of 90.29, 81.87, 75.40 for hate speech detection in English, German, Hindi.

02

Achieved macro F1 scores of 60.70, 53.28, 49.74 for fine-grained classification.

03

Showed the effectiveness of Perspective API features and multilingual training schemes.

Abstract

Detecting and classifying instances of hate in social media text has been a problem of interest in Natural Language Processing in the recent years. Our work leverages state of the art Transformer language models to identify hate speech in a multilingual setting. Capturing the intent of a post or a comment on social media involves careful evaluation of the language style, semantic content and additional pointers such as hashtags and emojis. In this paper, we look at the problem of identifying whether a Twitter post is hateful and offensive or not. We further discriminate the detected toxic content into one of the following three classes: (a) Hate Speech (HATE), (b) Offensive (OFFN) and (c) Profane (PRFN). With a pre-trained multilingual Transformer-based text encoder at the base, we are able to successfully identify and classify hate speech from multiple languages. On the provided…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sayarghoshroy/Hate-Speech-Detection
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Topic Modeling · Natural Language Processing Techniques

MethodsLinear Layer · Feature Selection · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dense Connections · Label Smoothing · Byte Pair Encoding · Multi-Head Attention · Attention Is All You Need · Dropout