BotHash: Efficient and Training-Free Bot Detection Through Approximate Nearest Neighbor
Edoardo Di Paolo, Fabio De Gaspari, Angelo Spognardi

TL;DR
BotHash is a training-free social bot detection method that uses approximate nearest neighbor search on simplified user representations, effectively identifying bots even with advanced LLM-generated content, and requires minimal training data.
Contribution
It introduces BotHash, a novel, training-free approach for social bot detection that simplifies user representation and leverages approximate nearest neighbor search, bypassing deep learning complexities.
Findings
Effective differentiation between human and bot accounts.
Robust performance with minimal ground-truth data.
Early detection capabilities across datasets.
Abstract
Online Social Networks (OSNs) are a cornerstone in modern society, serving as platforms for diverse content consumption by millions of users each day. However, the challenge of ensuring the accuracy of information shared on these platforms remains significant, especially with the widespread dissemination of disinformation. Social bots -- automated accounts designed to mimic human behavior, frequently spreading misinformation -- represent one of the critical problems of OSNs. The advent of Large Language Models (LLMs) has further complicated bot behaviors, making detection increasingly difficult. This paper presents BotHash, an innovative, training-free approach to social bot detection. BotHash leverages a simplified user representation that enables approximate nearest-neighbor search to detect bots, avoiding the complexities of Deep-Learning model training and large dataset creation. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications
