mdok-style at SemEval-2026 Task 10: Finetuning LLMs for Conspiracy Detection

Dominik Macko

arXiv:2605.02712·cs.CL·May 5, 2026

mdok-style at SemEval-2026 Task 10: Finetuning LLMs for Conspiracy Detection

Dominik Macko

PDF

TL;DR

This paper presents a system that fine-tunes a large language model with data augmentation and self-training to detect conspiracy beliefs in Reddit comments, achieving competitive results in SemEval-2026 Task 10.

Contribution

It introduces a novel application of machine-generated text detection techniques to conspiracy detection, demonstrating effectiveness with limited training data.

Findings

01

Achieved 85th percentile ranking in the competition.

02

Utilized data augmentation and self-training for fine-tuning.

03

Proved the approach's applicability to conspiracy detection.

Abstract

SemEval-2026 Task 10 is focused on conspiracy detection. Specifically, the goal is to detect whether a Reddit comment expresses a conspiracy belief. Our submitted mdok-style system utilizes data augmentation and self-training (to cope with a rather small amount of training data) to finetune the Qwen3-32B model for a binary text-classification task. The submitted system is very competitive, ranking in the 85th percentile (8th out of 52 submissions). The results shown that our approach, which originated in machine-generated text detection, can be used for conspiracy detection as well.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.