Sensorformer: Cross-patch attention with global-patch compression is effective for high-dimensional multivariate time series forecasting
Liyang Qin, Xiaoli Wang, Chunhua Yang, Huaiwen Zou, Haochuan Zhang

TL;DR
Sensorformer introduces a novel Transformer architecture that effectively fuses cross-time and cross-variable dependencies in high-dimensional multivariate time series forecasting by global patch compression and cross-patch attention, outperforming existing methods.
Contribution
It proposes Sensorformer, a new Transformer model that captures inter-variable correlations and causal relationships despite dynamic lags, with reduced computational complexity.
Findings
Sensorformer outperforms existing models on 9 real-world datasets.
It effectively captures dynamic causal relationships between variables.
The model reduces computational complexity compared to pure cross-patch self-attention.
Abstract
Among the existing Transformer-based multivariate time series forecasting methods, iTransformer, which treats each variable sequence as a token and only explicitly extracts cross-variable dependencies, and PatchTST, which adopts a channel-independent strategy and only explicitly extracts cross-time dependencies, both significantly outperform most Channel-Dependent Transformer that simultaneously extract cross-time and cross-variable dependencies. This indicates that existing Transformer-based multivariate time series forecasting methods still struggle to effectively fuse these two types of information. We attribute this issue to the dynamic time lags in the causal relationships between different variables. Therefore, we propose a new multivariate time series forecasting Transformer, Sensorformer, which first compresses the global patch information and then simultaneously extracts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Anomaly Detection Techniques and Applications · Complex Systems and Time Series Analysis
MethodsAttention Is All You Need · Absolute Position Encodings · Softmax · Linear Layer · Adam · Residual Connection · Dropout · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing
