Large-Scale Hate Speech Detection with Cross-Domain Transfer

Cagri Toraman; Furkan \c{S}ahinu\c{c}; Eyup Halit Yilmaz

arXiv:2203.01111·cs.CL·July 7, 2022·29 cites

Large-Scale Hate Speech Detection with Cross-Domain Transfer

Cagri Toraman, Furkan \c{S}ahinu\c{c}, Eyup Halit Yilmaz

PDF

Open Access 1 Repo

TL;DR

This paper introduces large-scale, multi-domain hate speech datasets in English and Turkish, demonstrating that Transformer models outperform traditional methods and generalize well across hate domains, with notable transfer learning capabilities.

Contribution

It creates extensive multilingual, multi-domain hate speech datasets and evaluates the effectiveness of Transformer models and cross-domain transfer learning in large-scale hate speech detection.

Findings

01

Transformer models outperform traditional models by at least 5% in English and 10% in Turkish.

02

High scalability with 98% and 97% performance recovery using only 20% of training data.

03

Cross-domain transfer recovers over 90% of performance, with gender and religion domains generalizing better.

Abstract

The performance of hate speech detection models relies on the datasets on which the models are trained. Existing datasets are mostly prepared with a limited number of instances or hate domains that define hate topics. This hinders large-scale analysis and transfer learning with respect to hate domains. In this study, we construct large-scale tweet datasets for hate speech detection in English and a low-resource language, Turkish, consisting of human-labeled 100k tweets per each. Our datasets are designed to have equal number of tweets distributed over five domains. The experimental results supported by statistical tests show that Transformer-based language models outperform conventional bag-of-words and neural models by at least 5% in English and 10% in Turkish for large-scale hate speech detection. The performance is also scalable to different training sizes, such that 98% of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

avaapm/hatespeech
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Internet Traffic Analysis and Secure E-voting