Predictive Response Optimization: Using Reinforcement Learning to Fight   Online Social Network Abuse

Garrett Wilson; Geoffrey Goh; Yan Jiang; Ajay Gupta; Jiaxuan Wang,; David Freeman; Francesco Dinuzzo

arXiv:2502.17693·cs.LG·February 26, 2025

Predictive Response Optimization: Using Reinforcement Learning to Fight Online Social Network Abuse

Garrett Wilson, Geoffrey Goh, Yan Jiang, Ajay Gupta, Jiaxuan Wang,, David Freeman, Francesco Dinuzzo

PDF

Open Access

TL;DR

This paper introduces Predictive Response Optimization (PRO), a reinforcement learning system that optimizes actions to combat social network abuse by balancing harm reduction and user experience, outperforming traditional classifiers.

Contribution

It proposes a novel reinforcement learning framework for selecting actions post-abuse detection, enhancing tradeoff optimization beyond binary classification.

Findings

01

PRO reduces abuse volume by 59% on Instagram

02

PRO reduces abuse volume by 4.5% on Facebook

03

PRO adapts quickly to changes in tactics and constraints

Abstract

Detecting phishing, spam, fake accounts, data scraping, and other malicious activity in online social networks (OSNs) is a problem that has been studied for well over a decade, with a number of important results. Nearly all existing works on abuse detection have as their goal producing the best possible binary classifier; i.e., one that labels unseen examples as "benign" or "malicious" with high precision and recall. However, no prior published work considers what comes next: what does the service actually do after it detects abuse? In this paper, we argue that detection as described in previous work is not the goal of those who are fighting OSN abuse. Rather, we believe the goal to be selecting actions (e.g., ban the user, block the request, show a CAPTCHA, or "collect more evidence") that optimize a tradeoff between harm caused by abuse and impact on benign users. With this framing,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection

Methodstravel james · Sparse Evolutionary Training