Why So Inflammatory? Explainability in Automatic Detection of Inflammatory Social Media Users
Cuong Nguyen, Daniel Nkemelu, Ankit Mehta, Michael Best

TL;DR
This paper investigates how interaction features in machine learning models can explain the detection of inflammatory social media users, emphasizing the importance of explainability in low-resource, high-risk settings like the Global South.
Contribution
It demonstrates the significance of interaction features in model explainability and extends understanding of inflammatory content detection in low-resource, high-risk social media contexts.
Findings
Interaction features like account age and activity count have higher explanatory power.
Features such as name length and bio location are less significant.
Explainability tools reveal key features influencing model decisions.
Abstract
Hate speech and misinformation, spread over social networking services (SNS) such as Facebook and Twitter, have inflamed ethnic and political violence in countries across the globe. We argue that there is limited research on this problem within the context of the Global South and present an approach for tackling them. Prior works have shown how machine learning models built with user-level interaction features can effectively identify users who spread inflammatory content. While this technique is beneficial in low-resource language settings where linguistic resources such as ground truth data and processing capabilities are lacking, it is still unclear how these interaction features contribute to model performance. In this work, we investigate and show significant differences in interaction features between users who spread inflammatory content and others who do not, applying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Misinformation and Its Impacts
