Scaling-aware rating of Poisson-limited demand forecasts
Malte C. Tichy, Illia Babounikau, Nikolas Wolke, Stefan Ulbrich, Michael Feindt

TL;DR
This paper introduces a scaling-aware rating method for demand forecasts that accounts for Poisson noise and rate-dependent errors, enabling fairer comparisons across products and industries.
Contribution
It proposes a stratification-based evaluation approach that benchmarks forecast metrics against theoretical Poisson limits, providing intuitive and comparable forecast quality ratings.
Findings
Effective in distinguishing unavoidable Poisson noise from model errors.
Applied to retail datasets, it yields clear, interpretable forecast quality assessments.
Facilitates fair comparison of models across different product categories.
Abstract
Forecast quality should be assessed in the context of what is possible in theory and what is reasonable to expect in practice. Often, one can identify an approximate upper bound to a probabilistic forecast's sharpness, which sets a lower, not necessarily achievable, limit to error metrics. In retail forecasting, a simple, but often unconquerable sharpness limit is given by the Poisson distribution. When evaluating forecasts using traditional metrics such as Mean Absolute Error, it is hard to judge whether a certain achieved value reflects unavoidable Poisson noise or truly indicates an over-dispersed prediction model. Moreover, every evaluation metric suffers from precision scaling: The metric's value is mostly defined by the selling rate and by the resulting rate-dependent Poisson noise, and only secondarily by the forecast quality. Comparing two groups of forecasted products often…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForecasting Techniques and Applications
