HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting
Shibo Feng, Peilin Zhao, Liu Liu, Pengcheng Wu, Zhiqi Shen

TL;DR
The paper introduces HDT, a hierarchical discrete transformer that models high-dimensional multivariate time series using discrete token representations, enabling scalable and long-term forecasting with improved accuracy.
Contribution
It proposes a novel hierarchical discrete transformer framework that models multivariate time series as discrete tokens, addressing scalability and long-term forecasting challenges.
Findings
HDT outperforms existing models on five MTS datasets.
The discrete token approach enables faster inference.
Hierarchical modeling extends forecasting length effectively.
Abstract
Generative models have gained significant attention in multivariate time series forecasting (MTS), particularly due to their ability to generate high-fidelity samples. Forecasting the probability distribution of multivariate time series is a challenging yet practical task. Although some recent attempts have been made to handle this task, two major challenges persist: 1) some existing generative methods underperform in high-dimensional multivariate time series forecasting, which is hard to scale to higher dimensions; 2) the inherent high-dimensional multivariate attributes constrain the forecasting lengths of existing generative models. In this paper, we point out that discrete token representations can model high-dimensional MTS with faster inference time, and forecasting the target with long-term trends of itself can extend the forecasting length with high accuracy. Motivated by this,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTime Series Analysis and Forecasting
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam · Softmax · Absolute Position Encodings · Dropout · Label Smoothing · Byte Pair Encoding
