Traffic4cast -- Large-scale Traffic Prediction using 3DResNet and Sparse-UNet
Bo Wang, Reza Mohajerpoor, Chen Cai, Inhi Kim, Hai L. Vu

TL;DR
This paper presents novel deep learning models, 3DResNet and Sparse-UNet, for large-scale city traffic prediction, demonstrating improved accuracy and efficiency over baselines in the Traffic4cast 2021 competition.
Contribution
Introduces 3DResNet and Sparse-UNet models for spatiotemporal traffic prediction, emphasizing efficiency and generalizability to new cities.
Findings
Both models outperform baseline algorithms.
Sparse-UNet reduces computation time significantly.
Models achieve competitive accuracy in traffic prediction.
Abstract
The IARAI competition Traffic4cast 2021 aims to predict short-term city-wide high-resolution traffic states given the static and dynamic traffic information obtained previously. The aim is to build a machine learning model for predicting the normalized average traffic speed and flow of the subregions of multiple large-scale cities using historical data points. The model is supposed to be generic, in a way that it can be applied to new cities. By considering spatiotemporal feature learning and modeling efficiency, we explore 3DResNet and Sparse-UNet approaches for the tasks in this competition. The 3DResNet based models use 3D convolution to learn the spatiotemporal features and apply sequential convolutional layers to enhance the temporal relationship of the outputs. The Sparse-UNet model uses sparse convolutions as the backbone for spatiotemporal feature learning. Since the latter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Anomaly Detection Techniques and Applications · Traffic and Road Safety
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · 3D Convolution · Convolution · Sparse Convolutions
