Evaluating AI capabilities in detecting conspiracy theories on YouTube

Leonardo La Rocca; Francesco Corso; Francesco Pierri

arXiv:2505.23570·cs.CL·July 8, 2025

Evaluating AI capabilities in detecting conspiracy theories on YouTube

Leonardo La Rocca, Francesco Corso, Francesco Pierri

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the effectiveness of large language models in detecting conspiracy theory videos on YouTube, highlighting their strengths in recall and limitations in precision, with implications for online content moderation.

Contribution

It compares open-weight LLMs and a fine-tuned RoBERTa baseline for conspiracy detection, providing insights into their performance and limitations in real-world scenarios.

Findings

01

Text-based LLMs have high recall but lower precision.

02

Multimodal models show limited benefits over text-only models.

03

RoBERTa performs nearly as well as larger LLMs in real-world tests.

Abstract

As a leading online platform with a vast global audience, YouTube's extensive reach also makes it susceptible to hosting harmful content, including disinformation and conspiracy theories. This study explores the use of open-weight Large Language Models (LLMs), both text-only and multimodal, for identifying conspiracy theory videos shared on YouTube. Leveraging a labeled dataset of thousands of videos, we evaluate a variety of LLMs in a zero-shot setting and compare their performance to a fine-tuned RoBERTa baseline. Results show that text-based LLMs achieve high recall but lower precision, leading to increased false positives. Multimodal models lag behind their text-only counterparts, indicating limited benefits from visual data integration. To assess real-world applicability, we evaluate the most accurate models on an unlabeled dataset, finding that RoBERTa achieves performance close…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leoli51/youtube-conspiracy-detection
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Hate Speech and Cyberbullying Detection · Spam and Phishing Detection

MethodsAttention Is All You Need · Linear Layer · Attention Dropout · Softmax · WordPiece · Refunds@Expedia|||How do I get a full refund from Expedia? · Weight Decay · Multi-Head Attention · Dropout · Residual Connection