ConspEmoLLM-v2: A robust and stable model to detect sentiment-transformed conspiracy theories

Zhiwei Liu; Paul Thompson; Jiaqi Rong; Sophia Ananiadou

arXiv:2505.14917·cs.CL·October 28, 2025

ConspEmoLLM-v2: A robust and stable model to detect sentiment-transformed conspiracy theories

Zhiwei Liu, Paul Thompson, Jiaqi Rong, Sophia Ananiadou

PDF

Open Access 1 Repo

TL;DR

This paper introduces ConspEmoLLM-v2, a model designed to detect sentiment-altered conspiracy theories, improving robustness against disguised misinformation by training on an augmented dataset with sentiment-reduced LLM-rewritten content.

Contribution

The paper presents an enhanced conspiracy detection model, ConspEmoLLM-v2, trained on ConDID-v2, which includes LLM-rewritten content to improve detection of sentiment-disguised conspiracy theories.

Findings

01

ConspEmoLLM-v2 outperforms previous models on sentiment-transformed conspiracy content.

02

The augmented dataset improves model robustness against sentiment disguise.

03

Experimental results show comparable or better performance on original data.

Abstract

Despite the many benefits of large language models (LLMs), they can also cause harm, e.g., through automatic generation of misinformation, including conspiracy theories. Moreover, LLMs can also ''disguise'' conspiracy theories by altering characteristic textual features, e.g., by transforming their typically strong negative emotions into a more positive tone. Although several studies have proposed automated conspiracy theory detection methods, they are usually trained using human-authored text, whose features can vary from LLM-generated text. Furthermore, several conspiracy detection models, including the previously proposed ConspEmoLLM, rely heavily on the typical emotional features of human-authored conspiracy content. As such, intentionally disguised content may evade detection. To combat such issues, we firstly developed an augmented version of the ConDID conspiracy detection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lzw108/conspemollm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Sentiment Analysis and Opinion Mining · Spam and Phishing Detection