Mining large-scale human mobility data for long-term crime prediction
Cristina Kadar, Irena Pletikosa

TL;DR
This study demonstrates that large-scale human mobility data significantly enhances long-term crime prediction models in urban areas, outperforming traditional census-based approaches by capturing dynamic human activity patterns.
Contribution
The paper introduces a novel approach using diverse mobility data sources and machine learning to improve crime prediction accuracy at the city level.
Findings
Mobility features improve R^2 up to 65% geographically and 89% temporally.
Human dynamics features are most predictive for grand larcenies.
Census features suffice for predicting assaults.
Abstract
Traditional crime prediction models based on census data are limited, as they fail to capture the complexity and dynamics of human activity. With the rise of ubiquitous computing, there is the opportunity to improve such models with data that make for better proxies of human presence in cities. In this paper, we leverage large human mobility data to craft an extensive set of features for crime prediction, as informed by theories in criminology and urban studies. We employ averaging and boosting ensemble techniques from machine learning, to investigate their power in predicting yearly counts for different types of crimes occurring in New York City at census tract level. Our study shows that spatial and spatio-temporal features derived from Foursquare venues and checkins, subway rides, and taxi rides, improve the baseline models relying on census and POI data. The proposed models achieve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
