Identifying Hidden Visits from Sparse Call Detail Record Data
Zhan Zhao, Haris N. Koutsopoulos, Jinhua Zhao

TL;DR
This paper introduces a data fusion method to detect hidden visits in sparse call detail record data, significantly improving mobility inference accuracy using machine learning models on a large real-world dataset.
Contribution
It presents a novel data fusion approach combining granular cellular data and communication records to identify hidden visits, addressing a key limitation in CDR-based mobility analysis.
Findings
Over 10% of displacements involve hidden visits.
Significant improvement over naive no-hidden-visit rule.
Effective use of machine learning models for inference.
Abstract
Despite a large body of literature on trip inference using call detail record (CDR) data, a fundamental understanding of their limitations is lacking. In particular, because of the sparse nature of CDR data, users may travel to a location without being revealed in the data, which we refer to as a "hidden visit". The existence of hidden visits hinders our ability to extract reliable information about human mobility and travel behavior from CDR data. In this study, we propose a data fusion approach to obtain labeled data for statistical inference of hidden visits. In the absence of complementary data, this can be accomplished by extracting labeled observations from more granular cellular data access records, and extracting features from voice call and text messaging records. The proposed approach is demonstrated using a real-world CDR dataset of 3 million users from a large Chinese city.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis · Data-Driven Disease Surveillance · Wireless Communication Networks Research
