Causality Guided Disentanglement for Cross-Platform Hate Speech Detection
Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

TL;DR
This paper proposes a causality-guided disentanglement approach to improve cross-platform hate speech detection by learning invariant features that generalize better across different social media platforms.
Contribution
It introduces a novel causality-based disentanglement model that separates platform-dependent and independent features for robust hate speech detection across unseen platforms.
Findings
Outperforms state-of-the-art methods in cross-platform hate speech detection
Achieves better generalization across four social media platforms
Disentangles features to improve robustness against distribution shifts
Abstract
Social media platforms, despite their value in promoting open discourse, are often exploited to spread harmful content. Current deep learning and natural language processing models used for detecting this harmful content overly rely on domain-specific terms affecting their capabilities to adapt to generalizable hate speech detection. This is because they tend to focus too narrowly on particular linguistic signals or the use of certain categories of words. Another significant challenge arises when platforms lack high-quality annotated data for training, leading to a need for cross-platform models that can adapt to different distribution shifts. Our research introduces a cross-platform hate speech detection model capable of being trained on one platform's data and generalizing to multiple unseen platforms. To achieve good generalizability across platforms, one way is to disentangle the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
MethodsFocus
