Fusing Cellular Network Data and Tollbooth Counts for Urban Traffic Flow Estimation
Oluwaleke Yusuf, Shaira Tabassum

TL;DR
This paper presents a machine learning framework that combines cellular network data and tollbooth counts to generate detailed, vehicle-specific origin-destination traffic flow estimates for urban planning.
Contribution
It introduces a novel method to disaggregate cellular mobility data using sparse tollbooth counts, enabling more accurate traffic flow estimation in data-scarce environments.
Findings
The framework successfully infers hourly OD matrices by vehicle category.
Limited tollbooth data can correct extensive cellular mobility data.
The approach is applicable to urban traffic planning in data-scarce contexts.
Abstract
Traffic simulations, essential for planning urban transit infrastructure interventions, require vehicle-category-specific origin-destination (OD) data. Existing data sources are imperfect: sparse tollbooth sensors provide accurate vehicle counts by category, while extensive mobility data from cellular network activity captures aggregated crowd movement, but lack modal disaggregation and have systematic biases. This study develops a machine learning framework to correct and disaggregate cellular network data using sparse tollbooth counts as ground truth. The model uses temporal and spatial features to learn the complex relationship between aggregated mobility data and vehicular data. The framework infers destinations from transit routes and implements routing logic to distribute corrected flows between OD pairs. This approach is applied to a bus depot expansion in Trondheim, Norway,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
