MALicious INTent Dataset and Inoculating LLMs for Enhanced Disinformation Detection
Arkadiusz Modzelewski, Witold Sosnowski, Eleni Papadopulos, Elisa Sartori, Tiziano Labruna, Giovanni Da San Martino, Adam Wierzbicki

TL;DR
This paper introduces MALINT, a new annotated dataset capturing malicious intent behind disinformation, and proposes intent-based inoculation to improve LLMs' ability to detect disinformation across multiple languages.
Contribution
The work presents MALINT, the first human-annotated corpus for malicious disinformation intent, and develops intent-augmented reasoning to enhance LLMs' detection capabilities.
Findings
Intent-augmented reasoning improves zero-shot disinformation detection.
MALINT dataset supports multi-language disinformation research.
Benchmarking shows LLMs benefit from intent analysis.
Abstract
The intentional creation and spread of disinformation poses a significant threat to public discourse. However, existing English datasets and research rarely address the intentionality behind the disinformation. This work presents MALINT, the first human-annotated English corpus developed in collaboration with expert fact-checkers to capture disinformation and its malicious intent. We utilize our novel corpus to benchmark 12 language models, including small language models (SLMs) such as BERT and large language models (LLMs) like Llama 3.3, on binary and multilabel intent classification tasks. Moreover, inspired by inoculation theory from psychology and communication studies, we investigate whether incorporating knowledge of malicious intent can improve disinformation detection. To this end, we propose intent-based inoculation, an intent-augmented reasoning for LLMs that integrates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMisinformation and Its Impacts · Deception detection and forensic psychology · Topic Modeling
