Revisiting Attention for Multivariate Time Series Forecasting
Haixiang Wu

TL;DR
This paper introduces Frequency Spectrum attention (FSatten) and Scaled Orthogonal attention (SOatten), novel mechanisms that improve multivariate time series forecasting by better capturing periodic dependencies and dependencies patterns, outperforming traditional attention methods.
Contribution
It proposes two new attention mechanisms, FSatten and SOatten, based on frequency domain and orthogonal embeddings, enhancing forecasting accuracy over conventional attention.
Findings
FSatten outperforms traditional attention in capturing periodic dependencies.
SOatten surpasses state-of-the-art methods in multivariate time series forecasting.
Both methods are effective alternatives to conventional attention mechanisms.
Abstract
Current Transformer methods for Multivariate Time-Series Forecasting (MTSF) are all based on the conventional attention mechanism. They involve sequence embedding and performing a linear projection of Q, K, and V, and then computing attention within this latent space. We have never delved into the attention mechanism to explore whether such a mapping space is optimal for MTSF. To investigate this issue, this study first proposes Frequency Spectrum attention (FSatten), a novel attention mechanism based on the frequency domain space. It employs the Fourier transform for embedding and introduces Multi-head Spectrum Scaling (MSS) to replace the conventional linear mapping of Q and K. FSatten can accurately capture the periodic dependencies between sequences and outperform the conventional attention without changing mainstream architectures. We further design a more general method dubbed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Forecasting Techniques and Applications · Stock Market Forecasting Methods
MethodsAttention Is All You Need · Residual Connection · Adam · Dropout · Byte Pair Encoding · Layer Normalization · Label Smoothing · Linear Layer · Softmax · Convolution
