Predictive Response Optimization: Using Reinforcement Learning to Fight Online Social Network Abuse
Garrett Wilson, Geoffrey Goh, Yan Jiang, Ajay Gupta, Jiaxuan Wang,, David Freeman, Francesco Dinuzzo

TL;DR
This paper introduces Predictive Response Optimization (PRO), a reinforcement learning system that optimizes actions to combat social network abuse by balancing harm reduction and user experience, outperforming traditional classifiers.
Contribution
It proposes a novel reinforcement learning framework for selecting actions post-abuse detection, enhancing tradeoff optimization beyond binary classification.
Findings
PRO reduces abuse volume by 59% on Instagram
PRO reduces abuse volume by 4.5% on Facebook
PRO adapts quickly to changes in tactics and constraints
Abstract
Detecting phishing, spam, fake accounts, data scraping, and other malicious activity in online social networks (OSNs) is a problem that has been studied for well over a decade, with a number of important results. Nearly all existing works on abuse detection have as their goal producing the best possible binary classifier; i.e., one that labels unseen examples as "benign" or "malicious" with high precision and recall. However, no prior published work considers what comes next: what does the service actually do after it detects abuse? In this paper, we argue that detection as described in previous work is not the goal of those who are fighting OSN abuse. Rather, we believe the goal to be selecting actions (e.g., ban the user, block the request, show a CAPTCHA, or "collect more evidence") that optimize a tradeoff between harm caused by abuse and impact on benign users. With this framing,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
Methodstravel james · Sparse Evolutionary Training
