Estimating Local Commuting Patterns From Geolocated Twitter Data
Graham McNeill, Jonathan Bright, Scott A. Hale

TL;DR
This paper demonstrates that geolocated Twitter data can effectively estimate local commuting patterns, outperforming existing models, with robustness to demographic biases and potential for incorporating temporal dynamics.
Contribution
It introduces a simple heuristic method using Twitter data for estimating commuting patterns, outperforming the radiation model, and explores sources of error and temporal extensions.
Findings
Twitter-based heuristics outperform the radiation model in estimating commuting patterns.
Model performs better on short trips with higher commuter volumes.
Demographic biases do not significantly affect the Twitter-based measurements.
Abstract
The emergence of large stores of transactional data generated by increasing use of digital devices presents a huge opportunity for policymakers to improve their knowledge of the local environment and thus make more informed and better decisions. A research frontier is hence emerging which involves exploring the type of measures that can be drawn from data stores such as mobile phone logs, Internet searches and contributions to social media platforms, and the extent to which these measures are accurate reflections of the wider population. This paper contributes to this research frontier, by exploring the extent to which local commuting patterns can be estimated from data drawn from Twitter. It makes three contributions in particular. First, it shows that simple heuristics drawn from geolocated Twitter data offer a good proxy for local commuting patterns; one which outperforms the major…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
