Rethinking Spatio-Temporal Transformer for Traffic Prediction:Multi-level Multi-view Augmented Learning Framework
Jiaqi Lin, Qianqian Ren

TL;DR
This paper introduces LVSTformer, a novel multi-level multi-view spatio-temporal transformer that effectively captures complex spatial and temporal dependencies for traffic prediction, achieving state-of-the-art results on multiple benchmarks.
Contribution
The paper proposes a multi-view augmented transformer with multi-level spatial and temporal modeling, enhancing traffic prediction accuracy and robustness.
Findings
Achieves up to 4.32% improvement over baselines.
Effectively captures multi-level spatial dependencies.
Enhances model robustness and generalization.
Abstract
Traffic prediction is a challenging spatio-temporal forecasting problem that involves highly complex spatio-temporal correlations. This paper proposes a Multi-level Multi-view Augmented Spatio-temporal Transformer (LVSTformer) for traffic prediction. The model aims to capture spatial dependencies from three different levels: local geographic, global semantic, and pivotal nodes, along with long- and short-term temporal dependencies. Specifically, we design three spatial augmented views to delve into the spatial information from the perspectives of local, global, and pivotal nodes. By combining three spatial augmented views with three parallel spatial self-attention mechanisms, the model can comprehensively captures spatial dependencies at different levels. We design a gated temporal self-attention mechanism to effectively capture long- and short-term temporal dependencies. Furthermore, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Time Series Analysis and Forecasting · Data Visualization and Analytics
MethodsSparse Evolutionary Training · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Attention Is All You Need · Linear Layer · Multi-Head Attention
