What Does the Bot Say? Opportunities and Risks of Large Language Models   in Social Media Bot Detection

Shangbin Feng; Herun Wan; Ningnan Wang; Zhaoxuan Tan; Minnan Luo,; Yulia Tsvetkov

arXiv:2402.00371·cs.CL·July 8, 2024·2 cites

What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

Shangbin Feng, Herun Wan, Ningnan Wang, Zhaoxuan Tan, Minnan Luo,, Yulia Tsvetkov

PDF

Open Access 1 Repo

TL;DR

This paper explores how large language models can enhance social media bot detection through novel frameworks, but also pose risks by enabling manipulation that can undermine detection accuracy and system reliability.

Contribution

It introduces a mixture-of-heterogeneous-experts framework for LLM-based bot detection and analyzes the risks of LLM-guided manipulation to evade detection.

Findings

01

Instruction tuning with 1,000 examples improves detection accuracy by up to 9.1%.

02

LLM-guided manipulation can reduce detection performance by up to 29.6%.

03

Manipulation strategies harm the calibration and reliability of detection systems.

Abstract

Social media bot detection has always been an arms race between advancements in machine learning bot detectors and adversarial bot strategies to evade detection. In this work, we bring the arms race to the next level by investigating the opportunities and risks of state-of-the-art large language models (LLMs) in social bot detection. To investigate the opportunities, we design novel LLM-based bot detectors by proposing a mixture-of-heterogeneous-experts framework to divide and conquer diverse user information modalities. To illuminate the risks, we explore the possibility of LLM-guided manipulation of user textual and structured information to evade detection. Extensive experiments with three LLMs on two datasets demonstrate that instruction tuning on merely 1,000 annotated examples produces specialized LLMs that outperform state-of-the-art baselines by up to 9.1% on both datasets,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bunsenfeng/botsay
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Hate Speech and Cyberbullying Detection