Spatiotemporal Tile-based Attention-guided LSTMs for Traffic Video Prediction
Tu Nguyen

TL;DR
This paper introduces a tile-aware, attention-guided Conv-LSTM model for traffic video prediction that captures fine-grained and coarse spatial structures while maintaining long-term temporal dependencies, demonstrating improved scalability and accuracy.
Contribution
The paper proposes a novel spatiotemporal tile-based attention mechanism integrated with Conv-LSTM, enabling scalable and accurate traffic prediction on large maps with theoretical analysis.
Findings
Enhanced scalability for large-scale traffic data
Competitive forecasting accuracy on Traffic4Cast dataset
Theoretical bounds on stability and memory-accuracy tradeoffs
Abstract
This extended abstract describes our solution for the Traffic4Cast Challenge 2019. The task requires modeling both fine-grained (pixel-level) and coarse (region-level) spatial structure while preserving temporal relationships across long sequences. Building on Conv-LSTM ideas, we introduce a tile-aware, cascaded-memory Conv-LSTM augmented with cross-frame additive attention and a memory-flexible training scheme: frames are sampled per spatial tile so the model learns tile-local dynamics and per-tile memory cells can be updated sparsely, paged, or compressed to scale to large maps. We provide a compact theoretical analysis (tight softmax/attention Lipschitz bound and a tiling error lower bound) explaining stability and the memory-accuracy tradeoffs, and empirically demonstrate improved scalability and competitive forecasting performance on large-scale traffic heatmaps.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
