Countering hate on social media: Large scale classification of hate and counter speech
Joshua Garland, Keyan Ghazi-Zahedi, Jean-Gabriel Young and, Laurent H\'ebert-Dufresne, Mirta Galesic

TL;DR
This paper develops a large-scale, high-accuracy classifier for hate and counter speech on social media, enabling analysis of their interactions and impact over millions of tweets to understand their role in stabilizing online discourse.
Contribution
It introduces a novel ensemble learning approach with paragraph embeddings for classifying hate and counter speech, validated on a large dataset with high accuracy and human agreement.
Findings
Classifier achieved macro F1 scores from 0.76 to 0.97
Automated detection aligns closely with human judgment
Analysis of 135,000 Twitter conversations reveals insights into hate and counter speech dynamics
Abstract
Hateful rhetoric is plaguing online discourse, fostering extreme societal movements and possibly giving rise to real-world violence. A potential solution to this growing global problem is citizen-generated counter speech where citizens actively engage in hate-filled conversations to attempt to restore civil non-polarized discourse. However, its actual effectiveness in curbing the spread of hatred is unknown and hard to quantify. One major obstacle to researching this question is a lack of large labeled data sets for training automated classifiers to identify counter speech. Here we made use of a unique situation in Germany where self-labeling groups engaged in organized online hate and counter speech. We used an ensemble learning algorithm which pairs a variety of paragraph embeddings with regularized logistic regression functions to classify both hate and counter speech in a corpus of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLogistic Regression
