Unified Training of Universal Time Series Forecasting Transformers
Gerald Woo, Chenghao Liu, Akshat Kumar, Caiming Xiong, Silvio, Savarese, Doyen Sahoo

TL;DR
This paper introduces Moirai, a universal time series forecasting Transformer trained on a large dataset, capable of handling diverse tasks without dataset-specific tuning, outperforming traditional models.
Contribution
The paper presents Moirai, a novel Transformer architecture for universal time series forecasting, trained on LOTSA, addressing cross-frequency, multivariate, and distributional challenges.
Findings
Moirai achieves competitive zero-shot forecasting performance.
Trained on LOTSA with over 27 billion observations.
Outperforms traditional models on diverse datasets.
Abstract
Deep learning for time series forecasting has traditionally operated within a one-model-per-dataset framework, limiting its potential to leverage the game-changing impact of large pre-trained models. The concept of universal forecasting, emerging from pre-training on a vast collection of time series datasets, envisions a single Large Time Series Model capable of addressing diverse downstream forecasting tasks. However, constructing such a model poses unique challenges specific to time series data: i) cross-frequency learning, ii) accommodating an arbitrary number of variates for multivariate time series, and iii) addressing the varying distributional properties inherent in large-scale data. To address these challenges, we present novel enhancements to the conventional time series Transformer architecture, resulting in our proposed Masked Encoder-based Universal Time Series Forecasting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗google/timesfm-2.0-500m-pytorchmodel· 30k dl· ♡ 24530k dl♡ 245
- 🤗Salesforce/moirai-2.0-R-smallmodel· 608k dl· ♡ 38608k dl♡ 38
- 🤗Salesforce/moirai-1.0-R-smallmodel· 6.4k dl· ♡ 306.4k dl♡ 30
- 🤗Salesforce/moirai-1.0-R-basemodel· 2.4k dl· ♡ 302.4k dl♡ 30
- 🤗Salesforce/moirai-1.0-R-largemodel· 16k dl· ♡ 8416k dl♡ 84
- 🤗Pranavv/moirai-basemodel
- 🤗Pranavv/moirai-largemodel
- 🤗sktime/moirai-1.0-R-smallmodel
- 🤗sktime/moirai-1.0-R-basemodel
- 🤗sktime/moirai-1.0-R-largemodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Neural Networks and Applications · Stock Market Forecasting Methods
MethodsAttention Is All You Need · Absolute Position Encodings · Linear Layer · Byte Pair Encoding · Multi-Head Attention · Adam · Residual Connection · Layer Normalization · Dense Connections · Position-Wise Feed-Forward Layer
