LMBot: Distilling Graph Knowledge into Language Model for Graph-less   Deployment in Twitter Bot Detection

Zijian Cai; Zhaoxuan Tan; Zhenyu Lei; Zifeng Zhu; Hongrui Wang,; Qinghua Zheng; Minnan Luo

arXiv:2306.17408·cs.AI·January 4, 2024

LMBot: Distilling Graph Knowledge into Language Model for Graph-less Deployment in Twitter Bot Detection

Zijian Cai, Zhaoxuan Tan, Zhenyu Lei, Zifeng Zhu, Hongrui Wang,, Qinghua Zheng, Minnan Luo

PDF

Open Access 1 Repo

TL;DR

LMBot is a novel framework that distills graph neural network knowledge into language models, enabling effective graph-less Twitter bot detection with improved robustness, efficiency, and versatility.

Contribution

The paper introduces LMBot, a method that transfers GNN knowledge into language models for graph-free deployment in Twitter bot detection, addressing data dependency and bias issues.

Findings

01

LMBot achieves state-of-the-art results on four benchmarks.

02

LMBot is more robust and versatile than traditional graph-based methods.

03

LMBot reduces inference time by eliminating the need for graph data.

Abstract

As malicious actors employ increasingly advanced and widespread bots to disseminate misinformation and manipulate public opinion, the detection of Twitter bots has become a crucial task. Though graph-based Twitter bot detection methods achieve state-of-the-art performance, we find that their inference depends on the neighbor users multi-hop away from the targets, and fetching neighbors is time-consuming and may introduce bias. At the same time, we find that after finetuning on Twitter bot detection, pretrained language models achieve competitive performance and do not require a graph structure during deployment. Inspired by this finding, we propose a novel bot detection framework LMBot that distills the knowledge of graph neural networks (GNNs) into language models (LMs) for graph-less deployment in Twitter bot detection to combat the challenge of data dependency. Moreover, LMBot is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

czjdsg/lmbot
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Spam and Phishing Detection · Network Security and Intrusion Detection