Simplistic Collection and Labeling Practices Limit the Utility of   Benchmark Datasets for Twitter Bot Detection

Chris Hays; Zachary Schutzman; Manish Raghavan; Erin Walk; Philipp; Zimmer

arXiv:2301.07015·cs.LG·May 2, 2023·5 cites

Simplistic Collection and Labeling Practices Limit the Utility of Benchmark Datasets for Twitter Bot Detection

Chris Hays, Zachary Schutzman, Manish Raghavan, Erin Walk, Philipp, Zimmer

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that the high performance of existing Twitter bot detection datasets is largely due to simplistic collection and labeling practices, which limit their generalizability and reliability for research and practical use.

Contribution

It reveals that current datasets' collection and labeling methods inflate performance metrics, highlighting the need for more robust and transparent practices in bot detection research.

Findings

01

Simple decision rules perform nearly as well as complex models on existing datasets.

02

Datasets do not generalize well to out-of-sample data, indicating overfitting.

03

Performance heavily depends on dataset collection and labeling procedures.

Abstract

Accurate bot detection is necessary for the safety and integrity of online platforms. It is also crucial for research on the influence of bots in elections, the spread of misinformation, and financial market manipulation. Platforms deploy infrastructure to flag or remove automated accounts, but their tools and data are not publicly available. Thus, the public must rely on third-party bot detection. These tools employ machine learning and often achieve near perfect performance for classification on existing datasets, suggesting bot detection is accurate, reliable and fit for use in downstream applications. We provide evidence that this is not the case and show that high performance is attributable to limitations in dataset collection and labeling rather than sophistication of the tools. Specifically, we show that simple decision rules -- shallow decision trees trained on a small number…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

johnchrishays/bot-detection
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Internet Traffic Analysis and Secure E-voting