Identification of Twitter Bots Based on an Explainable Machine Learning Framework: The US 2020 Elections Case Study
Alexander Shevtsov, Christos Tzagkarakis, Despoina Antonakaki, Sotiris, Ioannidis

TL;DR
This paper presents an explainable machine learning framework using XGBoost and SHAP for detecting Twitter bots, especially during elections, demonstrating improved accuracy over existing methods.
Contribution
The study introduces a novel, explainable bot detection system combining XGBoost with SHAP for feature importance, tailored for electoral period Twitter data.
Findings
Outperforms recent state-of-the-art bot detection methods in accuracy.
Uses SHAP for interpretability of ML model predictions.
Effective in identifying malicious bots during election periods.
Abstract
Twitter is one of the most popular social networks attracting millions of users, while a considerable proportion of online discourse is captured. It provides a simple usage framework with short messages and an efficient application programming interface (API) enabling the research community to study and analyze several aspects of this social network. However, the Twitter usage simplicity can lead to malicious handling by various bots. The malicious handling phenomenon expands in online discourse, especially during the electoral periods, where except the legitimate bots used for dissemination and communication purposes, the goal is to manipulate the public opinion and the electorate towards a certain direction, specific ideology, or political party. This paper focuses on the design of a novel system for identifying Twitter bots based on labeled Twitter data. To this end, a supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Network Security and Intrusion Detection
