Beyond Trial-and-Error: Predicting User Abandonment After a Moderation Intervention
Benedetta Tessa, Lorenzo Cima, Amaury Trujillo, Marco Avvenuti, Stefano Cresci

TL;DR
This paper introduces a predictive machine learning approach to estimate user abandonment after moderation interventions on Reddit, enabling proactive moderation strategies to improve platform engagement and reduce negative outcomes.
Contribution
It presents the first model to predict user abandonment post-moderation, utilizing extensive features and demonstrating strong generalizability across communities.
Findings
Best model achieved micro F1-score of 0.914.
Activity features are the most informative predictors.
Model generalizes well to unseen communities.
Abstract
Current content moderation follows a reactive, trial-and-error approach, where interventions are applied and their effects are only measured post-hoc. In contrast, we introduce a proactive, predictive approach that enables moderators to anticipate the impact of their actions before implementation. We propose and tackle the new task of predicting user abandonment following a moderation intervention. We study the reactions of 16,540 users to a massive ban of online communities on Reddit, training a set of binary classifiers to identify those users who would abandon the platform after the intervention -- a problem of great practical relevance. We leverage a dataset of 13.8 million posts to compute a large and diverse set of 142 features, which convey information about the activity, toxicity, relations, and writing style of the users. We obtain promising results, with the best-performing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
