Impermanent: A Live Benchmark for Temporal Generalization in Time Series Forecasting
Azul Garza, Ren\'ee Rosillo, Rodrigo Mendoza-Smith, David Salinas, Andrew Robert Williams, Arjun Ashok, Mononito Goswami, Jos\'e Mart\'in Ju\'arez

TL;DR
Impermanent introduces a live, continuously updated benchmark for evaluating the temporal robustness and generalization of time series forecasting models using real-world, non-stationary data streams from GitHub activity.
Contribution
It presents a novel live benchmarking framework that assesses models over time, addressing limitations of static test sets and enabling ongoing evaluation of temporal generalization.
Findings
Evaluates models on non-stationary GitHub data streams.
Provides a platform for ongoing, reproducible model comparison.
Highlights challenges in maintaining performance over evolving data.
Abstract
Recent advances in time-series forecasting increasingly rely on pre-trained foundation-style models. While these models often claim broad generalization, existing evaluation protocols provide limited evidence. Indeed, most current benchmarks use static train-test splits that can easily lead to contamination as foundation models can inadvertently train on test data or perform model selection using test scores, which can inflate performance. We introduce Impermanent, a live benchmark that evaluates forecasting models under open-world temporal change by scoring forecasts sequentially over time on continuously updated data streams, enabling the study of temporal robustness, distributional shift, and performance stability rather than one-off accuracy on a frozen test set. Impermanent is instantiated on GitHub open-source activity, providing a naturally live and highly non-stationary dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForecasting Techniques and Applications · Time Series Analysis and Forecasting · Traffic Prediction and Management Techniques
