LMBot: Distilling Graph Knowledge into Language Model for Graph-less Deployment in Twitter Bot Detection
Zijian Cai, Zhaoxuan Tan, Zhenyu Lei, Zifeng Zhu, Hongrui Wang,, Qinghua Zheng, Minnan Luo

TL;DR
LMBot is a novel framework that distills graph neural network knowledge into language models, enabling effective graph-less Twitter bot detection with improved robustness, efficiency, and versatility.
Contribution
The paper introduces LMBot, a method that transfers GNN knowledge into language models for graph-free deployment in Twitter bot detection, addressing data dependency and bias issues.
Findings
LMBot achieves state-of-the-art results on four benchmarks.
LMBot is more robust and versatile than traditional graph-based methods.
LMBot reduces inference time by eliminating the need for graph data.
Abstract
As malicious actors employ increasingly advanced and widespread bots to disseminate misinformation and manipulate public opinion, the detection of Twitter bots has become a crucial task. Though graph-based Twitter bot detection methods achieve state-of-the-art performance, we find that their inference depends on the neighbor users multi-hop away from the targets, and fetching neighbors is time-consuming and may introduce bias. At the same time, we find that after finetuning on Twitter bot detection, pretrained language models achieve competitive performance and do not require a graph structure during deployment. Inspired by this finding, we propose a novel bot detection framework LMBot that distills the knowledge of graph neural networks (GNNs) into language models (LMs) for graph-less deployment in Twitter bot detection to combat the challenge of data dependency. Moreover, LMBot is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Spam and Phishing Detection · Network Security and Intrusion Detection
