Detection of Hate Speech using BERT and Hate Speech Word Embedding with   Deep Model

Hind Saleh; Areej Alhothali; Kawthar Moria

arXiv:2111.01515·cs.CL·November 3, 2021

Detection of Hate Speech using BERT and Hate Speech Word Embedding with Deep Model

Hind Saleh, Areej Alhothali, Kawthar Moria

PDF

Open Access

TL;DR

This paper explores the use of domain-specific word embeddings and BERT in deep learning models to improve the automatic detection and classification of online hate speech, achieving high accuracy.

Contribution

It introduces the application of domain-specific word embeddings with BiLSTM and leverages BERT for hate speech detection, demonstrating their effectiveness.

Findings

01

Domain-specific embeddings with BiLSTM achieved 93% F1-score.

02

BERT achieved up to 96% F1-score.

03

Models effectively detect hate speech despite obfuscation tactics.

Abstract

The enormous amount of data being generated on the web and social media has increased the demand for detecting online hate speech. Detecting hate speech will reduce their negative impact and influence on others. A lot of effort in the Natural Language Processing (NLP) domain aimed to detect hate speech in general or detect specific hate speech such as religion, race, gender, or sexual orientation. Hate communities tend to use abbreviations, intentional spelling mistakes, and coded words in their communication to evade detection, adding more challenges to hate speech detection tasks. Thus, word representation will play an increasingly pivotal role in detecting hate speech. This paper investigates the feasibility of leveraging domain-specific word embedding in Bidirectional LSTM based deep model to automatically detect/classify hate speech. Furthermore, we investigate the use of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Residual Connection · Layer Normalization · Softmax · Weight Decay · WordPiece