Analysis of Propaganda in Tweets From Politically Biased Sources
Vivek Sharma, Mohammad Mahdi Shokri, Sarah Ita Levitan, Elena Filatova, Shweta Jain

TL;DR
This paper introduces JMBX, a dataset of tweets from journalists with varying political biases, and analyzes how bias influences propaganda language use, comparing LLMs and BERT in detection performance and discussing associated costs.
Contribution
It presents a new dataset of journalist tweets with bias annotations and evaluates the effectiveness and costs of different LLMs for propaganda detection.
Findings
Journalists from extreme bias outlets use more propaganda language.
LLMs outperform BERT-based models in propaganda detection.
Large language models entail significant monetary and environmental costs.
Abstract
News outlets are well known to have political associations, and many national outlets cultivate political biases to cater to different audiences. Journalists working for these news outlets have a big impact on the stories they cover. In this work, we present a methodology to analyze the role of journalists, affiliated with popular news outlets, in propagating their bias using some form of propaganda-like language. We introduce JMBX(Journalist Media Bias on X), a systematically collected and annotated dataset of 1874 tweets from Twitter (now known as X). These tweets are authored by popular journalists from 10 news outlets whose political biases range from extreme left to extreme right. We extract several insights from the data and conclude that journalists who are affiliated with outlets with extreme biases are more likely to use propaganda-like language in their writings compared to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
