On Designing Day Ahead and Same Day Ridership Level Prediction Models for City-Scale Transit Networks Using Noisy APC Data
Jose Paolo Talusan (1), Ayan Mukhopadhyay (1), Dan Freudberg (2),, Abhishek Dubey (1) ((1) Vanderbilt University, (2) Nashville Metropolitan, Transit Authority)

TL;DR
This paper develops machine learning models using multi-source, large-scale noisy data to accurately predict transit ridership at trip and stop levels, aiding resource allocation and passenger planning.
Contribution
It introduces a data fusion approach combining transit, weather, traffic, and calendar data for ridership prediction, with models trained on 17 million observations for improved accuracy.
Findings
Xgboost trip model outperforms baseline
LSTM stop model outperforms baseline
Models achieve high accuracy across entire service day
Abstract
The ability to accurately predict public transit ridership demand benefits passengers and transit agencies. Agencies will be able to reallocate buses to handle under or over-utilized bus routes, improving resource utilization, and passengers will be able to adjust and plan their schedules to avoid overcrowded buses and maintain a certain level of comfort. However, accurately predicting occupancy is a non-trivial task. Various reasons such as heterogeneity, evolving ridership patterns, exogenous events like weather, and other stochastic variables, make the task much more challenging. With the progress of big data, transit authorities now have access to real-time passenger occupancy information for their vehicles. The amount of data generated is staggering. While there is no shortage in data, it must still be cleaned, processed, augmented, and merged before any useful information can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Human Mobility and Location-Based Analysis · Transportation Planning and Optimization
Methodstravel james · Tanh Activation · Sigmoid Activation · Long Short-Term Memory
