Parallel Spatio-Temporal Attention-Based TCN for Multivariate Time Series Prediction
Fan Jin, Ke Zhang, Yipan Huang, Yifei Zhu, Baiping Chen

TL;DR
This paper introduces PSTA-TCN, a novel framework combining parallel spatio-temporal attention with stacked TCNs, significantly improving long-horizon multivariate time series prediction in terms of accuracy and training efficiency.
Contribution
The paper presents a new framework that integrates parallel spatio-temporal attention with TCNs, addressing limitations of RNNs for long-term forecasting in complex multivariate data.
Findings
Dramatic reduction in training times due to parallel computation
Substantial increase in prediction accuracy for long horizons
Prediction windows up to 13 times longer than existing methods
Abstract
As industrial systems become more complex and monitoring sensors for everything from surveillance to our health become more ubiquitous, multivariate time series prediction is taking an important place in the smooth-running of our society. A recurrent neural network with attention to help extend the prediction windows is the current-state-of-the-art for this task. However, we argue that their vanishing gradients, short memories, and serial architecture make RNNs fundamentally unsuited to long-horizon forecasting with complex data. Temporal convolutional networks (TCNs) do not suffer from gradient problems and they support parallel calculations, making them a more appropriate choice. Additionally, they have longer memories than RNNs, albeit with some instability and efficiency problems. Hence, we propose a framework, called PSTA-TCN, that combines a parallel spatio-temporal attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
