Online learning for Social Spammer Detection on Twitter

Phuc Tri Nguyen; Hideaki Takeda

arXiv:1605.04374·cs.SI·May 17, 2016

Online learning for Social Spammer Detection on Twitter

Phuc Tri Nguyen, Hideaki Takeda

PDF

Open Access

TL;DR

This paper explores online learning techniques for detecting social spammers on Twitter, addressing challenges of high data volume and evolving spammer strategies by enabling real-time model updates.

Contribution

It introduces an online learning framework for spammer detection on Twitter, demonstrating its efficiency over traditional batch methods and analyzing feature set effectiveness.

Findings

01

Online learning outperforms batch learning in adapting to spammer changes

02

The system efficiently updates models with minimal computation and memory

03

Optimal online methods depend on specific feature sets and data dynamics

Abstract

Social networking services like Twitter have been playing an import role in people's daily life since it supports new ways of communicating effectively and sharing information. The advantages of these social network services enable them rapidly growing. However, the rise of social network services is leading to the increase of unwanted, disruptive information from spammers, malware discriminators, and other content polluters. Negative effects of social spammers do not only annoy users, but also lead to financial loss and privacy issues. There are two main challenges of spammer detection on Twitter. Firstly, the data of social network scale with a huge volume of streaming social data. Secondly, spammers continually change their spamming strategy such as changing content patterns or trying to gain social influence, disguise themselves as far as possible. With those challenges, it is hard…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Network Security and Intrusion Detection · Misinformation and Its Impacts