An Unsupervised Approach to Detect Spam Campaigns that Use Botnets on Twitter
Zhouhan Chen, Devika Subramanian

TL;DR
This paper presents an unsupervised, real-time detection system for Twitter spam campaigns involving botnets, revealing that a significant portion of tweets originate from malicious bots connected to large-scale spam operations.
Contribution
It introduces a novel unsupervised detection approach that identifies bot groups based on duplicate content and URL shortening, and provides an accessible database of detected malicious bots.
Findings
Bots account for 10% to 50% of tweets from URL shortening services.
Large-scale spam campaigns control thousands of domains.
Detection system operates continuously and provides a public API.
Abstract
In recent years, Twitter has seen a proliferation of automated accounts or bots that send spam, offer clickbait, compromise security using malware, and attempt to skew public opinion. Previous research estimates that around 9% to 17% of Twitter accounts are bots contributing to between 16% to 56% of tweets on the medium. This paper introduces an unsupervised approach to detect Twitter spam campaigns in real-time. The bot groups we detect tweet duplicate content with shortened embedded URLs over extended periods of time. Our experiments with the detection protocol reveal that bots consistently account for 10% to 50% of tweets generated from 7 popular URL shortening services on Twitter. More importantly, we discover that bots using shortened URLs are connected to large scale spam campaigns that control thousands of domains. There appear to be two distinct mechanisms used to control bot…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Network Security and Intrusion Detection · Advanced Malware Detection Techniques
