Detection of Hate Speech using BERT and Hate Speech Word Embedding with Deep Model
Hind Saleh, Areej Alhothali, Kawthar Moria

TL;DR
This paper explores the use of domain-specific word embeddings and BERT in deep learning models to improve the automatic detection and classification of online hate speech, achieving high accuracy.
Contribution
It introduces the application of domain-specific word embeddings with BiLSTM and leverages BERT for hate speech detection, demonstrating their effectiveness.
Findings
Domain-specific embeddings with BiLSTM achieved 93% F1-score.
BERT achieved up to 96% F1-score.
Models effectively detect hate speech despite obfuscation tactics.
Abstract
The enormous amount of data being generated on the web and social media has increased the demand for detecting online hate speech. Detecting hate speech will reduce their negative impact and influence on others. A lot of effort in the Natural Language Processing (NLP) domain aimed to detect hate speech in general or detect specific hate speech such as religion, race, gender, or sexual orientation. Hate communities tend to use abbreviations, intentional spelling mistakes, and coded words in their communication to evade detection, adding more challenges to hate speech detection tasks. Thus, word representation will play an increasingly pivotal role in detecting hate speech. This paper investigates the feasibility of leveraging domain-specific word embedding in Bidirectional LSTM based deep model to automatically detect/classify hate speech. Furthermore, we investigate the use of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Residual Connection · Layer Normalization · Softmax · Weight Decay · WordPiece
