ConspEmoLLM-v2: A robust and stable model to detect sentiment-transformed conspiracy theories
Zhiwei Liu, Paul Thompson, Jiaqi Rong, Sophia Ananiadou

TL;DR
This paper introduces ConspEmoLLM-v2, a model designed to detect sentiment-altered conspiracy theories, improving robustness against disguised misinformation by training on an augmented dataset with sentiment-reduced LLM-rewritten content.
Contribution
The paper presents an enhanced conspiracy detection model, ConspEmoLLM-v2, trained on ConDID-v2, which includes LLM-rewritten content to improve detection of sentiment-disguised conspiracy theories.
Findings
ConspEmoLLM-v2 outperforms previous models on sentiment-transformed conspiracy content.
The augmented dataset improves model robustness against sentiment disguise.
Experimental results show comparable or better performance on original data.
Abstract
Despite the many benefits of large language models (LLMs), they can also cause harm, e.g., through automatic generation of misinformation, including conspiracy theories. Moreover, LLMs can also ''disguise'' conspiracy theories by altering characteristic textual features, e.g., by transforming their typically strong negative emotions into a more positive tone. Although several studies have proposed automated conspiracy theory detection methods, they are usually trained using human-authored text, whose features can vary from LLM-generated text. Furthermore, several conspiracy detection models, including the previously proposed ConspEmoLLM, rely heavily on the typical emotional features of human-authored conspiracy content. As such, intentionally disguised content may evade detection. To combat such issues, we firstly developed an augmented version of the ConDID conspiracy detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Sentiment Analysis and Opinion Mining · Spam and Phishing Detection
