TL;DR
This paper presents a data-driven approach to identify and predict online ban evasion by analyzing behavioral similarities between banned users and their evading accounts, improving moderation efficiency.
Contribution
It introduces the first dataset and methods for detecting ban evasion, revealing behavioral patterns and training classifiers with high accuracy for early detection.
Findings
Behavioral similarities between parent and child accounts in usernames, pages, and content.
High accuracy in predicting future evaders (AUC = 0.78) and early detection (AUC = 0.85).
Effective matching of evading accounts to their banned counterparts (MRR = 0.97).
Abstract
Moderators and automated methods enforce bans on malicious users who engage in disruptive behavior. However, malicious users can easily create a new account to evade such bans. Previous research has focused on other forms of online deception, like the simultaneous operation of multiple accounts by the same entities (sockpuppetry), impersonation of other individuals, and studying the effects of de-platforming individuals and communities. Here we conduct the first data-driven study of ban evasion, i.e., the act of circumventing bans on an online platform, leading to temporally disjoint operation of accounts by the same user. We curate a novel dataset of 8,551 ban evasion pairs (parent, child) identified on Wikipedia and contrast their behavior with benign users and non-evading malicious users. We find that evasion child accounts demonstrate similarities with respect to their banned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLogistic Regression
