Random Noise vs State-of-the-Art Probabilistic Forecasting Methods : A Case Study on CRPS-Sum Discrimination Ability
Alireza Koochali, Peter Schichtel, Andreas Dengel, Sheraz Ahmed

TL;DR
This paper critically evaluates the CRPS-sum metric for multivariate probabilistic forecasting, revealing its limitations and potential to misjudge model performance, especially when comparing complex models to simple noise-based ones.
Contribution
It systematically analyzes CRPS-sum's discrimination ability, identifies its flaws, and demonstrates how it can misleadingly favor random noise models over advanced methods.
Findings
CRPS-sum's performance depends on data properties.
CRPS-sum overlooks per-dimension performance.
Dummy noise models can outperform complex models under CRPS-sum.
Abstract
The recent developments in the machine learning domain have enabled the development of complex multivariate probabilistic forecasting models. Therefore, it is pivotal to have a precise evaluation method to gauge the performance and predictability power of these complex methods. To do so, several evaluation metrics have been proposed in the past (such as Energy Score, Dawid-Sebastiani score, variogram score), however, they cannot reliably measure the performance of a probabilistic forecaster. Recently, CRPS-sum has gained a lot of prominence as a reliable metric for multivariate probabilistic forecasting. This paper presents a systematic evaluation of CRPS-sum to understand its discrimination ability. We show that the statistical properties of target data affect the discrimination ability of CRPS-Sum. Furthermore, we highlight that CRPS-Sum calculation overlooks the performance of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForecasting Techniques and Applications · Air Quality Monitoring and Forecasting · Stock Market Forecasting Methods
