Real Time Prediction of Drive by Download Attacks on Twitter
Amir Javed, Pete Burnap, Omer Rana

TL;DR
This paper presents a machine learning approach to predict malicious URLs in tweets in real-time, enabling proactive blocking of drive-by-download attacks before they execute.
Contribution
The authors develop a machine learning model that predicts malicious URLs within one second of interaction, achieving high accuracy and enabling preemptive attack prevention.
Findings
99.2% F-measure with cross-validation
83.98% accuracy on unseen data
Real-time prediction within 1 second
Abstract
The popularity of Twitter for information discovery, coupled with the automatic shortening of URLs to save space, given the 140 character limit, provides cyber criminals with an opportunity to obfuscate the URL of a malicious Web page within a tweet. Once the URL is obfuscated the cyber criminal can lure a user to click on it with enticing text and images before carrying out a cyber attack using a malicious Web server. This is known as a drive-by- download. In a drive-by-download a user's computer system is infected while interacting with the malicious endpoint, often without them being made aware, the attack has taken place. An attacker can gain control of the system by exploiting unpatched system vulnerabilities and this form of attack currently represents one of the most common methods employed. In this paper, we build a machine learning model using machine activity data and tweet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Network Security and Intrusion Detection · Advanced Malware Detection Techniques
