Correcting socioeconomic bias in mobile phone mobility estimates using multilevel regression and poststratification
Leo Ferres, Laetitia Gauvin

TL;DR
This paper applies multilevel regression and poststratification (MRP) to correct socioeconomic bias in mobile phone mobility data, improving the accuracy of population mobility estimates from biased CDR samples.
Contribution
It introduces the use of MRP for CDR-based mobility studies, demonstrating bias correction using socioeconomic and geographic data in a real-world Chilean dataset.
Findings
MRP reduces the radius of gyration estimate by about 17%.
A geographic-only model still captures much of the socioeconomic bias.
MRP provides a principled correction for non-representative CDR data.
Abstract
Call detail records (CDR) from mobile phone networks are widely used to study human mobility however CDR data from a single mobile operator are inherently biased because the observed users do not mirror the population distribution. Using data from a major Chilean carrier in Santiago, we observe the user base is skewed by socioeconomic group, so aggregate metrics like radius of gyration are distorted by the population that is actually observed. To correct this sampling bias, we apply multilevel regression and poststratification (MRP), a method that is not yet standard for CDR-based mobility studies. We fit a Bayesian multilevel model for individual mobility using socioeconomic status, gender, and geography, with partial pooling across comunas, and then poststratify the predictions to match census demographics. This approach reduces the naive CDR estimate of average radius of gyration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
