TL;DR
This paper introduces a method that combines social network graph embeddings and language features to improve the prediction of Twitter users' socioeconomic attributes such as occupation and income, outperforming existing methods.
Contribution
The paper presents a novel approach that integrates social network embeddings with textual data for socioeconomic attribute prediction on Twitter.
Findings
Graph embeddings improve prediction accuracy.
Combining network and language features yields better results.
Method outperforms previous state-of-the-art approaches.
Abstract
Inferring socioeconomic attributes of social media users such as occupation and income is an important problem in computational social science. Automated inference of such characteristics has applications in personalised recommender systems, targeted computational advertising and online political campaigning. While previous work has shown that language features can reliably predict socioeconomic attributes on Twitter, employing information coming from users' social networks has not yet been explored for such complex user characteristics. In this paper, we describe a method for predicting the occupational class and the income of Twitter users given information extracted from their extended networks by learning a low-dimensional vector representation of users, i.e. graph embeddings. We use this representation to train predictive models for occupational class and income. Results on two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
