Forecasting Suspicious Account Activity at Large-Scale Online Service Providers
Hassan Halawa, Matei Ripeanu, Konstantin Beznosov, Baris Coskun,, Meizhu Liu

TL;DR
This paper presents an automated machine learning-based early warning system that predicts potentially compromised accounts in large online services using login activity data, enabling faster detection and response to social engineering attacks.
Contribution
The paper introduces a scalable, real-world applicable machine learning system for predicting account compromise using minimal login data, demonstrated on large-scale production data.
Findings
System achieves up to one month early detection of suspicious accounts.
High precision and recall in classifying future compromised accounts.
Effective with low-cost features derived from login data.
Abstract
In the face of large-scale automated social engineering attacks to large online services, fast detection and remediation of compromised accounts are crucial to limit the spread of new attacks and to mitigate the overall damage to users, companies, and the public at large. We advocate a fully automated approach based on machine learning: we develop an early warning system that harnesses account activity traces to predict which accounts are likely to be compromised in the future and generate suspicious activity. We hypothesize that this early warning is key for a more timely detection of compromised accounts and consequently faster remediation. We demonstrate the feasibility and applicability of the system through an experiment at a large-scale online service provider using four months of real-world production data encompassing hundreds of millions of users. We show that - even using only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Information and Cyber Security · Spam and Phishing Detection
