Multi-modal Identification of State-Sponsored Propaganda on Social Media
Xiaobo Guo, Soroush Vosoughi

TL;DR
This paper introduces a balanced multi-modal dataset and a visual-textual framework for detecting state-sponsored propaganda on social media, achieving high accuracy and interpretability, and establishing a benchmark for future research.
Contribution
It is the first to create a balanced dataset and develop a multi-model framework for propaganda detection based on visual and textual content.
Findings
Achieved F1=0.869 on same time period detection
Achieved F1=0.697 on cross-time detection
Provided interpretability tools for model explanations
Abstract
The prevalence of state-sponsored propaganda on the Internet has become a cause for concern in the recent years. While much effort has been made to identify state-sponsored Internet propaganda, the problem remains far from being solved because the ambiguous definition of propaganda leads to unreliable data labelling, and the huge amount of potential predictive features causes the models to be inexplicable. This paper is the first attempt to build a balanced dataset for this task. The dataset is comprised of propaganda by three different organizations across two time periods. A multi-model framework for detecting propaganda messages solely based on the visual and textual content is proposed which achieves a promising performance on detecting propaganda by the three organizations both for the same time period (training and testing on data from the same time period) (F1=0.869) and for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsInterpretability
