Predictive Embeddings for Hate Speech Detection on Twitter

Rohan Kshirsagar; Tyus Cukuvac; Kathleen McKeown; Susan McGregor

arXiv:1809.10644·cs.CL·September 28, 2018

Predictive Embeddings for Hate Speech Detection on Twitter

Rohan Kshirsagar, Tyus Cukuvac, Kathleen McKeown, Susan McGregor

PDF

TL;DR

This paper introduces a neural network approach utilizing pre-trained embeddings and pooling techniques to detect hate speech on Twitter, achieving high accuracy with fewer parameters and less preprocessing.

Contribution

The study presents a novel neural network model that outperforms existing methods in hate speech detection using minimal features and computational resources.

Findings

01

Achieves state-of-the-art F1 scores on three datasets

02

Uses significantly fewer parameters than previous models

03

Requires minimal feature preprocessing

Abstract

We present a neural-network based approach to classifying online hate speech in general, as well as racist and sexist speech in particular. Using pre-trained word embeddings and max/mean pooling from simple, fully-connected transformations of these embeddings, we are able to predict the occurrence of hate speech on three commonly used publicly available datasets. Our models match or outperform state of the art F1 performance on all three datasets using significantly fewer parameters and minimal feature preprocessing compared to previous methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.