Comparing hundreds of machine learning classifiers and discrete choice models in predicting travel behavior: an empirical benchmark
Shenhao Wang, Baichuan Mo, Yunhan Zheng, Stephane Hess, Jinhua Zhao

TL;DR
This study develops an empirical benchmark to compare machine learning and discrete choice models in travel demand prediction, revealing the importance of data context and inherent uncertainty in model performance.
Contribution
It introduces a formal statistical benchmark for model comparison, accounting for randomness and contextual effects, and highlights the significance of data sources and transferability.
Findings
ML models, especially ensemble and deep learning, often outperform DCMs.
Contextual factors significantly influence model performance.
Inherent randomness persists even after controlling for model and data factors.
Abstract
Numerous studies have compared machine learning (ML) and discrete choice models (DCMs) in predicting travel demand. However, these studies often lack generalizability as they compare models deterministically without considering contextual variations. To address this limitation, our study develops an empirical benchmark by designing a tournament model, thus efficiently summarizing a large number of experiments, quantifying the randomness in model comparisons, and using formal statistical tests to differentiate between the model and contextual effects. This benchmark study compares two large-scale data sources: a database compiled from literature review summarizing 136 experiments from 35 studies, and our own experiment data, encompassing a total of 6,970 experiments from 105 models and 12 model families. This benchmark study yields two key findings. Firstly, many ML models, particularly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation Planning and Optimization · Traffic Prediction and Management Techniques · Economic and Environmental Valuation
MethodsEmirates Airlines Office in Dubai
