Performance and sensitivities of home detection from mobile phone data
Maarten Vanhoof, Clement Lee, Zbigniew Smoreda

TL;DR
This study evaluates the accuracy and uncertainties of home detection algorithms using large-scale mobile phone data from France, revealing moderate performance and significant effects of observation time and parameters.
Contribution
It provides an extensive empirical validation of nine home detection algorithms at a national scale, highlighting key sources of uncertainty and their impacts.
Findings
Nation-wide home detection correlates with ground truth at 0.60.
Observation time significantly affects detection performance.
Parameter choices have smaller effects compared to other uncertainties.
Abstract
Large-scale location based traces, such as mobile phone data, have been identified as a promising data source to complement or even enrich official statistics. In many cases, a prerequisite step to deploy the massively gathered data is the detection of home location from individual users. The problem is that little research exists on the validation (comparison with ground truth datasets) or the uncertainty estimation of home detection methods, not at individual user level, nor at nation-wide levels. In this paper, we present an extensive empirical analysis of home detection methods when performed on a nation-wide mobile phone dataset from France. We analyze the validity of 9 different Home Detection Algorithms (HDAs), and we assess different sources of uncertainty. Based on 225 different set-ups for the home detection of around 18 million users we discuss different measures for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
