Practical Guidelines for Ideology Detection Pipelines and Psychosocial Applications
Rohit Ram, Emma Thomas, David Kernot, Marian-Andrei Rizoiu

TL;DR
This paper offers practical guidelines and a robust pipeline for online ideology detection, addressing data collection challenges, bias quantification, and applying psychosocial theories to large-scale datasets to improve understanding of ideological patterns.
Contribution
It introduces a framework for ideology signal selection, evaluates bias, and demonstrates a high-performance detection pipeline on large datasets, bridging offline psychosocial insights with online applications.
Findings
Right-wing ideologies use more vice-moral language
Greater association of national flags with certain ideologies
Pipeline outperforms state-of-the-art with 0.95 AUC ROC
Abstract
Online ideology detection is crucial for downstream tasks, like countering ideologically motivated violent extremism and modeling opinion dynamics. However, two significant issues arise in practitioners' deployment. Firstly, gold-standard training data is prohibitively labor-intensive to collect and has limited reusability beyond its collection context (i.e., time, location, and platform). Secondly, to circumvent expense, researchers employ ideological signals (such as hashtags shared). Unfortunately, these signals' annotation requirements and context transferability are largely unknown, and the bias they induce remains unquantified. This study provides guidelines for practitioners requiring real-time detection of left, right, and extreme ideologies in large-scale online settings. We propose a framework for pipeline constructions, describing ideology signals by their associated labor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Hate Speech and Cyberbullying Detection · Media Influence and Politics
