OSM+: Billion-Level OpenStreetMap Dataset for City-wide Experiments
Guanjie Zheng, Ziyang Su, Yiheng Wang, Yuhang Luo, Hongwei Zhang, Xuanhe Zhou, Linghe Kong, Fan Wu, Wen Ling

TL;DR
OSM+ is a billion-vertex global road network dataset derived from OpenStreetMap, enabling scalable city-wide experiments in traffic prediction, boundary detection, and policy control.
Contribution
The paper introduces OSM+, a large-scale, accessible, and structured road network dataset with tools for multimodal data integration, addressing scalability and benchmarking gaps in graph learning.
Findings
OSM+ contains 1 billion vertices, enabling large-scale city experiments.
Constructed new benchmarks for traffic prediction across 31 cities.
Released datasets for multi-agent traffic policy control in six cities.
Abstract
Road network data provides rich information about cities, but processing worldwide OpenStreetMap (OSM) data is computationally intensive, and the resulting graphs are often difficult to unify for benchmarking downstream tasks. Existing graph learning benchmarks fail to capture the billion-scale and unique topological properties of real-world road networks, leaving model scalability underexplored. To close this gap, we process OSM data with distributed cloud computing using 5,000 cores and release \textbf{OSM+}, a structured worldwide 1-billion-vertex road network graph dataset designed for high accessibility and usability. OSM+ is open source and globally downloadable, providing an open-box graph structure and an easy spatial query interface; the evaluated release is a fixed snapshot for reproducibility, with a versioned update plan for future releases. We demonstrate the utility of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Automated Road and Building Extraction · Data Management and Algorithms
