Making and Evaluating Point Forecasts
Tilmann Gneiting

TL;DR
This paper examines how the choice of scoring functions impacts the evaluation of point forecasts, emphasizing the importance of matching scoring functions to forecast objectives for accurate assessment.
Contribution
It clarifies the conditions under which scoring functions are consistent and elicitable for different statistical functionals, providing guidance for proper forecast evaluation.
Findings
Elicitable functionals include means, quantiles, and ratios of expectations.
Consistent scoring functions are characterized as Bregman functions or generalized piecewise linear functions.
Conditional value-at-risk is not elicitable, highlighting limitations in some financial risk measures.
Abstract
Typically, point forecasting methods are compared and assessed by means of an error measure or scoring function, such as the absolute error or the squared error. The individual scores are then averaged over forecast cases, to result in a summary measure of the predictive performance, such as the mean absolute error or the (root) mean squared error. I demonstrate that this common practice can lead to grossly misguided inferences, unless the scoring function and the forecasting task are carefully matched. Effective point forecasting requires that the scoring function be specified ex ante, or that the forecaster receives a directive in the form of a statistical functional, such as the mean or a quantile of the predictive distribution. If the scoring function is specified ex ante, the forecaster can issue the optimal point forecast, namely, the Bayes rule. If the forecaster receives a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForecasting Techniques and Applications · Advanced Statistical Methods and Models · Statistical and numerical algorithms
