Toto 2.0: Time Series Forecasting Enters the Scaling Era
Emaad Khwaja, Chris Lettieri, Gerald Woo, Eden Belouadah, Marc Cenac, Guillaume Jarry, Enguerrand Paquin, Xunyi Zhao, Viktoriya Zhukov, Othmane Abou-Amal, Chenghao Liu, Ameet Talwalkar, David Asker

TL;DR
Toto 2.0 demonstrates that time series foundation models improve forecast quality with scale, releasing open models that set new benchmarks across multiple datasets.
Contribution
The paper introduces Toto 2.0, a scalable family of time series models trained with a unified recipe, achieving state-of-the-art results and providing open access to all checkpoints.
Findings
Models scale reliably from 4M to 2.5B parameters.
Toto 2.0 achieves new state-of-the-art on BOOM, GIFT-Eval, and TIME benchmarks.
Open-source release of all five base checkpoints.
Abstract
We show that time series foundation models scale: a single training recipe produces reliable forecast-quality improvements from 4M to 2.5B parameters. We release Toto 2.0, a family of five open-weights forecasting models trained under this recipe. The Toto 2.0 family sets a new state of the art on three forecasting benchmarks: BOOM, our observability benchmark; GIFT-Eval, the standard general-purpose benchmark; and the recent contamination-resistant TIME benchmark. This report describes our experimental results and details the design decisions behind Toto 2.0: its architecture and training recipe, training data, and the u-muP hyperparameter transfer pipeline. All five base checkpoints are released under Apache 2.0.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Datadog/Toto-2.0-4mmodel· 2.7k dl· ♡ 162.7k dl♡ 16
- 🤗Datadog/Toto-2.0-22mmodel· 8.2k dl· ♡ 128.2k dl♡ 12
- 🤗Datadog/Toto-2.0-313mmodel· 6.6k dl· ♡ 216.6k dl♡ 21
- 🤗Datadog/Toto-2.0-1Bmodel· 3.2k dl· ♡ 153.2k dl♡ 15
- 🤗Datadog/Toto-2.0-2.5Bmodel· 6.9k dl· ♡ 446.9k dl♡ 44
- 🤗Datadog/Toto-2.0-Family-and-Friendsmodel· ♡ 3♡ 3
- 🤗Datadog/Toto-2.0-2.5B-FTmodel· 110 dl· ♡ 1110 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
