A Streaming Machine Learning Framework for Online Aggression Detection on Twitter
Herodotos Herodotou, Despoina Chatzakou, Nicolas Kourtellis

TL;DR
This paper presents a real-time streaming machine learning framework for detecting online aggression on Twitter, capable of adapting to evolving behavior and scaling efficiently to massive data streams with high accuracy.
Contribution
It introduces the first practical streaming ML framework for Twitter aggression detection that adapts incrementally and scales to the full Twitter Firehose.
Findings
Achieves over 90% accuracy, precision, and recall.
Scales to 778 million tweets per day with only 3 machines.
Can detect sarcasm, racism, and sexism in real time.
Abstract
The rise of online aggression on social media is evolving into a major point of concern. Several machine and deep learning approaches have been proposed recently for detecting various types of aggressive behavior. However, social media are fast paced, generating an increasing amount of content, while aggressive behavior evolves over time. In this work, we introduce the first, practical, real-time framework for detecting aggression on Twitter via embracing the streaming machine learning paradigm. Our method adapts its ML classifiers in an incremental fashion as it receives new annotated examples and is able to achieve the same (or even higher) performance as batch-based ML models, with over 90% accuracy, precision, and recall. At the same time, our experimental analysis on real Twitter data reveals how our framework can easily scale to accommodate the entire Twitter Firehose (of 778…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
