TL;DR
This paper introduces STHD, a scalable Transformer model designed to improve high-dimensional multivariate time series forecasting by addressing noise and training challenges in channel-dependent models.
Contribution
The paper proposes STHD, a novel scalable Transformer architecture with relation matrix sparsity, a reindexing training strategy, and 2-D input handling to effectively forecast high-dimensional MTS data.
Findings
STHD outperforms existing models on three high-dimensional datasets.
The relation matrix sparsity reduces noise and memory usage.
ReIndex enhances training flexibility and data diversity.
Abstract
Deep models for Multivariate Time Series (MTS) forecasting have recently demonstrated significant success. Channel-dependent models capture complex dependencies that channel-independent models cannot capture. However, the number of channels in real-world applications outpaces the capabilities of existing channel-dependent models, and contrary to common expectations, some models underperform the channel-independent models in handling high-dimensional data, which raises questions about the performance of channel-dependent models. To address this, our study first investigates the reasons behind the suboptimal performance of these channel-dependent models on high-dimensional MTS data. Our analysis reveals that two primary issues lie in the introduced noise from unrelated series that increases the difficulty of capturing the crucial inter-channel dependencies, and challenges in training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Residual Connection · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Softmax · Absolute Position Encodings · Dense Connections
