TL;DR
This paper investigates how well user demographics can be predicted from their app usage data, revealing that certain demographics like gender are highly predictable, and analyzing factors affecting prediction accuracy.
Contribution
It extends previous work by predicting additional demographics, comparing dimensionality reduction methods, and analyzing the impact of app count and training size on predictability.
Findings
Gender prediction accuracy is 82.3%.
Income prediction accuracy is 60.3%.
Prediction accuracy varies with number of apps and training data size.
Abstract
Understanding the demographics of app users is crucial, for example, for app developers, who wish to target their advertisements more effectively. Our work addresses this need by studying the predictability of user demographics based on the list of a user's apps which is readily available to many app developers. We extend previous work on the problem on three frontiers: (1) We predict new demographics (age, race, and income) and analyze the most informative apps for four demographic attributes included in our analysis. The most predictable attribute is gender (82.3 % accuracy), whereas the hardest to predict is income (60.3 % accuracy). (2) We compare several dimensionality reduction methods for high-dimensional app data, finding out that an unsupervised method yields superior results compared to aggregating the apps at the app category level, but the best results are obtained simply by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
