Harnessing Mobile Phone Social Network Topology to Infer Users Demographic Attributes
Jorge Brea, Javier Burroni, Martin Minnoni, Carlos Sarraute

TL;DR
This paper analyzes mobile phone social network topology in Mexico to predict users' age with high accuracy, leveraging network structure and known user ages to infer demographics at scale.
Contribution
It introduces a novel graph-based algorithm that uses network topology and seed data to accurately predict users' age categories in a large mobile phone network.
Findings
Achieved 62% accuracy in 4-category age prediction, surpassing random guess baseline of 25%.
Enhanced prediction accuracy to 72% using probabilistic inference methods.
Demonstrated the effectiveness of topological metrics in improving demographic inference.
Abstract
We study the structure of the social graph of mobile phone users in the country of Mexico, with a focus on demographic attributes of the users (more specifically the users' age). We examine assortativity patterns in the graph, and observe a strong age homophily in the communications preferences. We propose a graph based algorithm for the prediction of the age of mobile phone users. The algorithm exploits the topology of the mobile phone network, together with a subset of known users ages (seeds), to infer the age of remaining users. We provide the details of the methodology, and show experimental results on a network GT with more than 70 million users. By carefully examining the topological relations of the seeds to the rest of the nodes in GT, we find topological metrics which have a direct influence on the performance of the algorithm. In particular we characterize subsets of users…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
