Evaluating AI capabilities in detecting conspiracy theories on YouTube
Leonardo La Rocca, Francesco Corso, Francesco Pierri

TL;DR
This paper evaluates the effectiveness of large language models in detecting conspiracy theory videos on YouTube, highlighting their strengths in recall and limitations in precision, with implications for online content moderation.
Contribution
It compares open-weight LLMs and a fine-tuned RoBERTa baseline for conspiracy detection, providing insights into their performance and limitations in real-world scenarios.
Findings
Text-based LLMs have high recall but lower precision.
Multimodal models show limited benefits over text-only models.
RoBERTa performs nearly as well as larger LLMs in real-world tests.
Abstract
As a leading online platform with a vast global audience, YouTube's extensive reach also makes it susceptible to hosting harmful content, including disinformation and conspiracy theories. This study explores the use of open-weight Large Language Models (LLMs), both text-only and multimodal, for identifying conspiracy theory videos shared on YouTube. Leveraging a labeled dataset of thousands of videos, we evaluate a variety of LLMs in a zero-shot setting and compare their performance to a fine-tuned RoBERTa baseline. Results show that text-based LLMs achieve high recall but lower precision, leading to increased false positives. Multimodal models lag behind their text-only counterparts, indicating limited benefits from visual data integration. To assess real-world applicability, we evaluate the most accurate models on an unlabeled dataset, finding that RoBERTa achieves performance close…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Hate Speech and Cyberbullying Detection · Spam and Phishing Detection
MethodsAttention Is All You Need · Linear Layer · Attention Dropout · Softmax · WordPiece · Refunds@Expedia|||How do I get a full refund from Expedia? · Weight Decay · Multi-Head Attention · Dropout · Residual Connection
