Trajectory Test-Train Overlap in Next-Location Prediction Datasets
Massimiliano Luca, Luca Pappalardo, Bruno Lepri, Gianni Barlacchi

TL;DR
This paper investigates the extent of trajectory overlap in next-location prediction datasets, revealing a memorization issue and proposing a reranking method that significantly enhances generalization, especially for unseen trajectories.
Contribution
It uncovers the prevalence of trajectory overlap in datasets and introduces a reranking approach to improve predictor generalization in next-location prediction.
Findings
High trajectory overlap limits predictor generalization.
Proposed reranking improves accuracy up to 96.15% on low-overlap data.
Predictors tend to memorize trajectories rather than generalize.
Abstract
Next-location prediction, consisting of forecasting a user's location given their historical trajectories, has important implications in several fields, such as urban planning, geo-marketing, and disease spreading. Several predictors have been proposed in the last few years to address it, including last-generation ones based on deep learning. This paper tests the generalization capability of these predictors on public mobility datasets, stratifying the datasets by whether the trajectories in the test set also appear fully or partially in the training set. We consistently discover a severe problem of trajectory overlapping in all analyzed datasets, highlighting that predictors memorize trajectories while having limited generalization capacities. We thus propose a methodology to rerank the outputs of the next-location predictors based on spatial mobility patterns. With these techniques,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis · Data Management and Algorithms · Geographic Information Systems Studies
