Parallel Spatio-Temporal Attention-Based TCN for Multivariate Time   Series Prediction

Fan Jin; Ke Zhang; Yipan Huang; Yifei Zhu; Baiping Chen

arXiv:2203.00971·cs.LG·March 3, 2022

Parallel Spatio-Temporal Attention-Based TCN for Multivariate Time Series Prediction

Fan Jin, Ke Zhang, Yipan Huang, Yifei Zhu, Baiping Chen

PDF

TL;DR

This paper introduces PSTA-TCN, a novel framework combining parallel spatio-temporal attention with stacked TCNs, significantly improving long-horizon multivariate time series prediction in terms of accuracy and training efficiency.

Contribution

The paper presents a new framework that integrates parallel spatio-temporal attention with TCNs, addressing limitations of RNNs for long-term forecasting in complex multivariate data.

Findings

01

Dramatic reduction in training times due to parallel computation

02

Substantial increase in prediction accuracy for long horizons

03

Prediction windows up to 13 times longer than existing methods

Abstract

As industrial systems become more complex and monitoring sensors for everything from surveillance to our health become more ubiquitous, multivariate time series prediction is taking an important place in the smooth-running of our society. A recurrent neural network with attention to help extend the prediction windows is the current-state-of-the-art for this task. However, we argue that their vanishing gradients, short memories, and serial architecture make RNNs fundamentally unsuited to long-horizon forecasting with complex data. Temporal convolutional networks (TCNs) do not suffer from gradient problems and they support parallel calculations, making them a more appropriate choice. Additionally, they have longer memories than RNNs, albeit with some instability and efficiency problems. Hence, we propose a framework, called PSTA-TCN, that combines a parallel spatio-temporal attention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.