Predicting Hateful Discussions on Reddit using Graph Transformer Networks and Communal Context
Liam Hebert, Lukasz Golab, Robin Cohen

TL;DR
This paper introduces a novel approach combining Graph Transformer Networks and community-specific analysis to predict harmful Reddit discussions, significantly improving detection accuracy by leveraging conversation context and community nuances.
Contribution
It presents a new framework integrating Graph Transformer Networks with community-specific modeling for more effective hate speech detection in social media discussions.
Findings
Community-specific models double detection performance.
Wider discussion context improves accuracy by 28%.
Context-aware models perform better on highly hateful content.
Abstract
We propose a system to predict harmful discussions on social media platforms. Our solution uses contextual deep language models and proposes the novel idea of integrating state-of-the-art Graph Transformer Networks to analyze all conversations that follow an initial post. This framework also supports adapting to future comments as the conversation unfolds. In addition, we study whether a community-specific analysis of hate speech leads to more effective detection of hateful discussions. We evaluate our approach on 333,487 Reddit discussions from various communities. We find that community-specific modeling improves performance two-fold and that models which capture wider-discussion context improve accuracy by 28\% (35\% for the most hateful content) compared to limited context models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Social Media and Politics · Misinformation and Its Impacts
MethodsMulti-Head Attention · Attention Is All You Need · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Linear Layer · Dropout · Softmax · Laplacian EigenMap · Adam · Residual Connection
