HDT: Hierarchical Discrete Transformer for Multivariate Time Series   Forecasting

Shibo Feng; Peilin Zhao; Liu Liu; Pengcheng Wu; Zhiqi Shen

arXiv:2502.08302·cs.LG·February 13, 2025

HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting

Shibo Feng, Peilin Zhao, Liu Liu, Pengcheng Wu, Zhiqi Shen

PDF

Open Access 1 Repo 1 Video

TL;DR

The paper introduces HDT, a hierarchical discrete transformer that models high-dimensional multivariate time series using discrete token representations, enabling scalable and long-term forecasting with improved accuracy.

Contribution

It proposes a novel hierarchical discrete transformer framework that models multivariate time series as discrete tokens, addressing scalability and long-term forecasting challenges.

Findings

01

HDT outperforms existing models on five MTS datasets.

02

The discrete token approach enables faster inference.

03

Hierarchical modeling extends forecasting length effectively.

Abstract

Generative models have gained significant attention in multivariate time series forecasting (MTS), particularly due to their ability to generate high-fidelity samples. Forecasting the probability distribution of multivariate time series is a challenging yet practical task. Although some recent attempts have been made to handle this task, two major challenges persist: 1) some existing generative methods underperform in high-dimensional multivariate time series forecasting, which is hard to scale to higher dimensions; 2) the inherent high-dimensional multivariate attributes constrain the forecasting lengths of existing generative models. In this paper, we point out that discrete token representations can model high-dimensional MTS with faster inference time, and forecasting the target with long-term trends of itself can extend the forecasting length with high accuracy. Motivated by this,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hdtkk/HDT
pytorchOfficial

Videos

HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting· underline

Taxonomy

TopicsTime Series Analysis and Forecasting

MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam · Softmax · Absolute Position Encodings · Dropout · Label Smoothing · Byte Pair Encoding