Unified Training of Universal Time Series Forecasting Transformers

Gerald Woo; Chenghao Liu; Akshat Kumar; Caiming Xiong; Silvio; Savarese; Doyen Sahoo

arXiv:2402.02592·cs.LG·May 24, 2024·32 cites

Unified Training of Universal Time Series Forecasting Transformers

Gerald Woo, Chenghao Liu, Akshat Kumar, Caiming Xiong, Silvio, Savarese, Doyen Sahoo

PDF

Open Access 1 Repo 10 Models 3 Datasets

TL;DR

This paper introduces Moirai, a universal time series forecasting Transformer trained on a large dataset, capable of handling diverse tasks without dataset-specific tuning, outperforming traditional models.

Contribution

The paper presents Moirai, a novel Transformer architecture for universal time series forecasting, trained on LOTSA, addressing cross-frequency, multivariate, and distributional challenges.

Findings

01

Moirai achieves competitive zero-shot forecasting performance.

02

Trained on LOTSA with over 27 billion observations.

03

Outperforms traditional models on diverse datasets.

Abstract

Deep learning for time series forecasting has traditionally operated within a one-model-per-dataset framework, limiting its potential to leverage the game-changing impact of large pre-trained models. The concept of universal forecasting, emerging from pre-training on a vast collection of time series datasets, envisions a single Large Time Series Model capable of addressing diverse downstream forecasting tasks. However, constructing such a model poses unique challenges specific to time series data: i) cross-frequency learning, ii) accommodating an arbitrary number of variates for multivariate time series, and iii) addressing the varying distributional properties inherent in large-scale data. To address these challenges, we present novel enhancements to the conventional time series Transformer architecture, resulting in our proposed Masked Encoder-based Universal Time Series Forecasting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SalesforceAIResearch/uni2ts
jaxOfficial

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Neural Networks and Applications · Stock Market Forecasting Methods

MethodsAttention Is All You Need · Absolute Position Encodings · Linear Layer · Byte Pair Encoding · Multi-Head Attention · Adam · Residual Connection · Layer Normalization · Dense Connections · Position-Wise Feed-Forward Layer