Online learning for Social Spammer Detection on Twitter
Phuc Tri Nguyen, Hideaki Takeda

TL;DR
This paper explores online learning techniques for detecting social spammers on Twitter, addressing challenges of high data volume and evolving spammer strategies by enabling real-time model updates.
Contribution
It introduces an online learning framework for spammer detection on Twitter, demonstrating its efficiency over traditional batch methods and analyzing feature set effectiveness.
Findings
Online learning outperforms batch learning in adapting to spammer changes
The system efficiently updates models with minimal computation and memory
Optimal online methods depend on specific feature sets and data dynamics
Abstract
Social networking services like Twitter have been playing an import role in people's daily life since it supports new ways of communicating effectively and sharing information. The advantages of these social network services enable them rapidly growing. However, the rise of social network services is leading to the increase of unwanted, disruptive information from spammers, malware discriminators, and other content polluters. Negative effects of social spammers do not only annoy users, but also lead to financial loss and privacy issues. There are two main challenges of spammer detection on Twitter. Firstly, the data of social network scale with a huge volume of streaming social data. Secondly, spammers continually change their spamming strategy such as changing content patterns or trying to gain social influence, disguise themselves as far as possible. With those challenges, it is hard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Network Security and Intrusion Detection · Misinformation and Its Impacts
