A Systematic Evaluation of Generative Models on Tabular Transportation Data
Chengen Wang, Alvaro Cardenas, Gurcan Comert, Murat Kantarcioglu

TL;DR
This paper systematically evaluates various generative models for creating synthetic transportation data, highlighting their limitations and proposing new metrics to better assess their structural and privacy-preserving qualities.
Contribution
It introduces a novel graph-based metric and an improved privacy metric tailored for transportation data, and provides a comprehensive evaluation of existing models on NYC taxi data.
Findings
Existing models often underperform on transportation data.
The new graph metric reveals significant structural gaps.
Current models may not ensure adequate privacy protection.
Abstract
The sharing of large-scale transportation data is beneficial for transportation planning and policymaking. However, it also raises significant security and privacy concerns, as the data may include identifiable personal information, such as individuals' home locations. To address these concerns, synthetic data generation based on real transportation data offers a promising solution that allows privacy protection while potentially preserving data utility. Although there are various synthetic data generation techniques, they are often not tailored to the unique characteristics of transportation data, such as the inherent structure of transportation networks formed by all trips in the datasets. In this paper, we use New York City taxi data as a case study to conduct a systematic evaluation of the performance of widely used tabular data generative models. In addition to traditional metrics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Transportation Planning and Optimization
