It's TIME: Towards the Next Generation of Time Series Forecasting Benchmarks
Zhongzheng Qiao, Sheng Pan, Anni Wang, Viktoriya Zhukova, Yong Liu, Xudong Jiang, Qingsong Wen, Mingsheng Long, Ming Jin, Chenghao Liu

TL;DR
This paper introduces TIME, a comprehensive, high-quality benchmark with diverse datasets and tasks for evaluating time series foundation models, emphasizing real-world relevance and generalizable insights.
Contribution
It presents a new benchmark with 50 datasets and 98 tasks, rigorous data quality assurance, real-world task formulation, and a pattern-level evaluation approach for better model assessment.
Findings
Evaluated 12 TSFMs on the new benchmark.
Established a multi-granular leaderboard for analysis.
Demonstrated improved insights through pattern-level evaluation.
Abstract
Time series foundation models (TSFMs) are revolutionizing the forecasting landscape from specific dataset modeling to generalizable task evaluation. However, we contend that existing benchmarks exhibit common limitations in four dimensions: constrained data composition dominated by reused legacy sources, compromised data integrity lacking rigorous quality assurance, misaligned task formulations detached from real-world contexts, and rigid analysis perspectives that obscure generalizable insights. To bridge these gaps, we introduce TIME, a next-generation task-centric benchmark comprising 50 fresh datasets and 98 forecasting tasks, tailored for strict zero-shot TSFM evaluation free from data leakage. Integrating large language models and human expertise, we establish a rigorous human-in-the-loop benchmark construction pipeline to ensure high data integrity and redefine task formulation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForecasting Techniques and Applications · Time Series Analysis and Forecasting · Machine Learning in Healthcare
