TL;DR
This paper introduces TIMME, a multi-task multi-relational embedding model designed to predict political ideology from Twitter data, effectively handling data sparsity and heterogeneity, and outperforming existing models.
Contribution
The paper presents a novel embedding model, TIMME, tailored for ideology detection on Twitter, addressing challenges of incomplete labels and heterogeneous data.
Findings
Links can predict ideology effectively without text
Conservative voices are under-represented on Twitter
Follow relation is most predictive of ideology
Abstract
We aim at solving the problem of predicting people's ideology, or political tendency. We estimate it by using Twitter data, and formalize it as a classification problem. Ideology-detection has long been a challenging yet important problem. Certain groups, such as the policy makers, rely on it to make wise decisions. Back in the old days when labor-intensive survey-studies were needed to collect public opinions, analyzing ordinary citizens' political tendencies was uneasy. The rise of social medias, such as Twitter, has enabled us to gather ordinary citizen's data easily. However, the incompleteness of the labels and the features in social network datasets is tricky, not to mention the enormous data size and the heterogeneousity. The data differ dramatically from many commonly-used datasets, thus brings unique challenges. In our work, first we built our own datasets from Twitter. Next,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
