ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic   Time-Series Transformer

Jiawen Zhang; Shun Zheng; Xumeng Wen; Xiaofang Zhou; Jiang Bian; Jia; Li

arXiv:2411.01842·cs.LG·November 5, 2024

ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic Time-Series Transformer

Jiawen Zhang, Shun Zheng, Xumeng Wen, Xiaofang Zhou, Jiang Bian, Jia, Li

PDF

Open Access 1 Repo 1 Video

TL;DR

ElasTST introduces a versatile, non-autoregressive time-series transformer that effectively handles varied forecasting horizons through innovative design features like structured self-attention, tunable position embeddings, and multi-scale patches.

Contribution

This paper presents ElasTST, a novel elastic transformer architecture for time-series forecasting that maintains invariance across different horizons during inference.

Findings

01

ElasTST outperforms state-of-the-art models in varied-horizon forecasting tasks.

02

The model demonstrates robustness and adaptability across multiple datasets.

03

Experimental results validate the effectiveness of its design components.

Abstract

Numerous industrial sectors necessitate models capable of providing robust forecasts across various horizons. Despite the recent strides in crafting specific architectures for time-series forecasting and developing pre-trained universal models, a comprehensive examination of their capability in accommodating varied-horizon forecasting during inference is still lacking. This paper bridges this gap through the design and evaluation of the Elastic Time-Series Transformer (ElasTST). The ElasTST model incorporates a non-autoregressive design with placeholders and structured self-attention masks, warranting future outputs that are invariant to adjustments in inference horizons. A tunable version of rotary position embedding is also integrated into ElasTST to capture time-series-specific periods and enhance adaptability to different horizons. Additionally, ElasTST employs a multi-scale patch…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/probts
jaxOfficial

Videos

ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic Time-Series Transformer· slideslive

Taxonomy

TopicsAdvanced Image Fusion Techniques

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Dropout · Absolute Position Encodings