Detecting Polarized Topics Using Partisanship-aware Contextualized Topic Embeddings
Zihao He, Negar Mokhberian, Antonio Camara, Andres Abeliuk, Kristina, Lerman

TL;DR
This paper introduces PaCTE, a novel method that uses partisanship-aware contextualized embeddings to automatically detect and measure polarized topics in partisan news sources, demonstrated on COVID-19 news articles.
Contribution
We propose PaCTE, a new approach that leverages fine-tuned language models and cosine distance to quantify topic polarization from partisan news data.
Findings
Effective in identifying highly polarized topics
Demonstrates robustness across different news sources
Outperforms baseline methods in polarization detection
Abstract
Growing polarization of the news media has been blamed for fanning disagreement, controversy and even violence. Early identification of polarized topics is thus an urgent matter that can help mitigate conflict. However, accurate measurement of topic-wise polarization is still an open research challenge. To address this gap, we propose Partisanship-aware Contextualized Topic Embeddings (PaCTE), a method to automatically detect polarized topics from partisan news sources. Specifically, utilizing a language model that has been finetuned on recognizing partisanship of the news articles, we represent the ideology of a news corpus on a topic by corpus-contextualized topic embedding and measure the polarization using cosine distance. We apply our method to a dataset of news articles about the COVID-19 pandemic. Extensive experiments on different news sources and topics demonstrate the efficacy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Media and Politics · Topic Modeling · Computational and Text Analysis Methods
