TwiBot-22: Towards Graph-Based Twitter Bot Detection
Shangbin Feng, Zhaoxuan Tan, Herun Wan, Ningnan Wang, Zilong Chen,, Binchi Zhang, Qinghua Zheng, Wenqian Zhang, Zhenyu Lei, Shujie Yang, Xinshun, Feng, Qingyue Zhang, Hongrui Wang, Yuhan Liu, Yuyang Bai, Heng Wang, Zijian, Cai, Yanbo Wang, Lijing Zheng, Zihan Ma, Jundong Li

TL;DR
TwiBot-22 introduces the largest, most comprehensive graph-based Twitter bot detection dataset and evaluation framework, enabling improved research and benchmarking in the field.
Contribution
It provides a large-scale, high-quality, graph-based Twitter bot detection dataset and a unified evaluation framework, addressing limitations of previous datasets and facilitating fair comparison of detection methods.
Findings
Re-implemented 35 bot detection baselines for comparison.
Evaluated models on 9 datasets including TwiBot-22.
Demonstrated improved detection performance with the new dataset.
Abstract
Twitter bot detection has become an increasingly important task to combat misinformation, facilitate social media moderation, and preserve the integrity of the online discourse. State-of-the-art bot detection methods generally leverage the graph structure of the Twitter network, and they exhibit promising performance when confronting novel Twitter bots that traditional methods fail to detect. However, very few of the existing Twitter bot detection datasets are graph-based, and even these few graph-based datasets suffer from limited dataset scale, incomplete graph structure, as well as low annotation quality. In fact, the lack of a large-scale graph-based Twitter bot detection benchmark that addresses these issues has seriously hindered the development and evaluation of novel graph-based bot detection approaches. In this paper, we propose TwiBot-22, a comprehensive graph-based Twitter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Internet Traffic Analysis and Secure E-voting
