Online Human-Bot Interactions: Detection, Estimation, and   Characterization

Onur Varol; Emilio Ferrara; Clayton A. Davis; Filippo Menczer,; Alessandro Flammini

arXiv:1703.03107·cs.SI·March 28, 2017·238 cites

Online Human-Bot Interactions: Detection, Estimation, and Characterization

Onur Varol, Emilio Ferrara, Clayton A. Davis, Filippo Menczer,, Alessandro Flammini

PDF

Open Access 4 Repos

TL;DR

This paper presents a comprehensive framework for detecting and characterizing social bots on Twitter using a wide range of features, achieving high accuracy and revealing insights into bot behaviors and prevalence.

Contribution

The study introduces a novel detection framework leveraging over a thousand features, validated on diverse datasets, and provides detailed characterization of different bot types and their interaction patterns.

Findings

01

Bots constitute 9-15% of active Twitter accounts.

02

Simple bots tend to interact with more human-like bots.

03

Different bot subclasses exhibit distinct content and interaction strategies.

Abstract

Increasing evidence suggests that a growing amount of social media content is generated by autonomous entities known as social bots. In this work we present a framework to detect such entities on Twitter. We leverage more than a thousand features extracted from public data and meta-data about users: friends, tweet content and sentiment, network patterns, and activity time series. We benchmark the classification framework by using a publicly available dataset of Twitter bots. This training data is enriched by a manually annotated collection of active Twitter users that include both humans and bots of varying sophistication. Our models yield high accuracy and agreement with each other and can detect bots of different nature. Our estimates suggest that between 9% and 15% of active Twitter accounts are bots. Characterizing ties among accounts, we observe that simple bots tend to interact…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Complex Network Analysis Techniques