Detecting Online Hate Speech: Approaches Using Weak Supervision and Network Embedding Models
Michael Ridenhour, Arunkumar Bagavathi, Elaheh Raisi, Siddharth, Krishnan

TL;DR
This paper introduces a weak supervision deep learning approach combined with network embedding techniques to detect online hate speech and identify hateful users on social media platforms like Gab, outperforming baseline models.
Contribution
It presents a novel weak supervision model for scoring interaction-level hate and integrates multilayer network embeddings for improved hateful user prediction.
Findings
Model outperforms baseline in identifying indirect hate interactions.
Multilayer network embeddings improve hateful user prediction by up to 7%.
Analysis uncovers both direct and indirect hateful conversations.
Abstract
The ubiquity of social media has transformed online interactions among individuals. Despite positive effects, it has also allowed anti-social elements to unite in alternative social media environments (eg. Gab.com) like never before. Detecting such hateful speech using automated techniques can allow social media platforms to moderate their content and prevent nefarious activities like hate speech propagation. In this work, we propose a weak supervision deep learning model that - (i) quantitatively uncover hateful users and (ii) present a novel qualitative analysis to uncover indirect hateful conversations. This model scores content on the interaction level, rather than the post or user level, and allows for characterization of users who most frequently participate in hateful conversations. We evaluate our model on 19.2M posts and show that our weak supervision model outperforms the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
