Large-scale Gender/Age Prediction of Tumblr Users
Yao Zhan, Changwei Hu, Yifan Hu, Tejaswi Kasturi, Shanmugam Ramasamy,, Matt Gillingham, Keith Yamamoto

TL;DR
This paper develops graph-based and deep learning models to predict Tumblr users' age and gender from content and social network data, significantly improving demographic inference accuracy for targeted advertising.
Contribution
It introduces novel graph and deep learning approaches for large-scale demographic prediction using rich user content and social connections on Tumblr.
Findings
Achieved 81% relative improvement in age prediction accuracy.
Improved gender prediction AUC and accuracy by 5%.
Validated models on a dataset with hundreds of millions of users.
Abstract
Tumblr, as a leading content provider and social media, attracts 371 million monthly visits, 280 million blogs and 53.3 million daily posts. The popularity of Tumblr provides great opportunities for advertisers to promote their products through sponsored posts. However, it is a challenging task to target specific demographic groups for ads, since Tumblr does not require user information like gender and ages during their registration. Hence, to promote ad targeting, it is essential to predict user's demography using rich content such as posts, images and social connections. In this paper, we propose graph based and deep learning models for age and gender predictions, which take into account user activities and content features. For graph based models, we come up with two approaches, network embedding and label propagation, to generate connection features as well as directly infer user's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Epigenetics and DNA Methylation · Human Mobility and Location-Based Analysis
