Why Model Selection Fails in Time Series Forecasting: An Empirical Study of Instability Across Data Regimes
Tahir Cetin Akinci, Alfredo A. Martinez-Morales

TL;DR
This study empirically demonstrates that simple, rule-based model selection methods in time series forecasting are unreliable due to high performance variability across different data regimes and horizons.
Contribution
It introduces a descriptor-based framework to analyze data regimes and shows that static rules cannot reliably predict the best forecasting models.
Findings
Rule-based model selection achieves low accuracy in identifying optimal models.
Significant discrepancies exist between recommended and empirically optimal models.
Model performance is highly sensitive to data characteristics and forecasting horizon.
Abstract
Time series forecasting models often exhibit inconsistent performance across datasets with varying statistical and structural properties. Despite the wide range of available forecasting techniques, it remains unclear whether model selection can be reliably guided by simple data characteristics. This paper investigates why rule-based model selection fails in time series forecasting by analyzing the relationship between data-regime descriptors and model performance. A descriptor-based framework is introduced to characterize time series using measurable properties, including trend strength, seasonality, noise level, and temporal dependence. Based on these descriptors, a rule-based selection mechanism is formulated to map data regimes to candidate forecasting models. The approach is evaluated on multiple real-world datasets across different domains and forecasting horizons. The results show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
