A set of R packages to estimate population counts from mobile phone data
Bogdan Oancea, David Salgado, Luis Sanguiao Sande, Sandra Barragan

TL;DR
This paper introduces a set of R packages that collectively estimate population counts from mobile phone data, integrating geolocation, deduplication, aggregation, and inference to support official statistics.
Contribution
The paper presents a novel software framework with four R packages that operationalize the estimation of population counts from mobile phone data for official statistics.
Findings
Software stack enables population estimation from mobile data
Packages facilitate data deduplication and aggregation
Integrated approach improves population count accuracy
Abstract
In this paper, we describe the software implementation of the methodological framework designed to incorporate mobile phone data into the current production chain of official statistics during the ESSnet Big Data II project. We present an overview of the architecture of the software stack, its components, the interfaces between them, and show how they can be used. Our software implementation consists in four R packages: destim for estimation of the spatial distribution of the mobile devices, deduplication for classification of the devices as being in 1:1 or 2:1 correspondence with its owner, aggregation for estimation of the number of individuals detected by the network starting from the geolocation probabilities and the duplicity probabilities and inference which combines the number of individuals provided by the previous package with other information like the population counts from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Opportunistic and Delay-Tolerant Networks · Human Mobility and Location-Based Analysis
