Cherry-Picking in Time Series Forecasting: How to Select Datasets to Make Your Model Shine
Luis Roque, Carlos Soares, Vitor Cerqueira, Luis Torgo

TL;DR
This study investigates how dataset selection bias, especially cherry-picking, impacts the evaluation of time series forecasting methods, revealing that limited datasets can misrepresent method performance and emphasizing the need for comprehensive testing.
Contribution
It highlights the effects of dataset selection bias on forecasting method evaluation and proposes that increasing dataset diversity improves reliability of performance assessments.
Findings
Cherry-picking datasets can exaggerate method effectiveness.
Using only four datasets, 46% of methods appear top-performing.
Expanding from 3 to 6 datasets reduces incorrect best-method identification by 40%.
Abstract
The importance of time series forecasting drives continuous research and the development of new approaches to tackle this problem. Typically, these methods are introduced through empirical studies that frequently claim superior accuracy for the proposed approaches. Nevertheless, concerns are rising about the reliability and generalizability of these results due to limitations in experimental setups. This paper addresses a critical limitation: the number and representativeness of the datasets used. We investigate the impact of dataset selection bias, particularly the practice of cherry-picking datasets, on the performance evaluation of forecasting methods. Through empirical analysis with a diverse set of benchmark datasets, our findings reveal that cherry-picking datasets can significantly distort the perceived performance of methods, often exaggerating their effectiveness. Furthermore,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Forecasting Techniques and Applications · Stock Market Forecasting Methods
MethodsSparse Evolutionary Training
