Hierarchical Evaluation Function: A Multi-Metric Approach for Optimizing Demand Forecasting Models
Adolfo Gonz\'alez, V\'ictor Parada

TL;DR
The paper introduces a multi-metric hierarchical evaluation function (HEF) for demand forecasting models, improving model selection and accuracy by integrating multiple evaluation criteria rather than relying on a single metric.
Contribution
It proposes a novel multi-metric evaluation framework (HEF) for hyperparameter optimization, enhancing demand forecasting accuracy and robustness across diverse datasets.
Findings
HEF outperforms unimetric functions across datasets.
HEF improves stability and generalization of models.
HEF is computationally efficient and effective.
Abstract
Demand forecasting in competitive, uncertain business environments requires models that can integrate multiple evaluation perspectives rather than being restricted to hyperparameter optimization based on a single metric. This traditional approach tends to prioritize one error indicator, which can bias results when metrics provide contradictory signals. In this context, the Hierarchical Evaluation Function (HEF) is proposed as a multi-metric framework for hyperparameter optimization that integrates explanatory power (R2), sensitivity to extreme errors (RMSE), and average accuracy (MAE). The performance of HEF was assessed using four widely recognized benchmark datasets in the forecasting domain: Walmart, M3, M4, and M5. Prediction models were optimized through Grid Search, Particle Swarm Optimization (PSO), and Optuna, and statistical analyses based on difference-of-proportions tests…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForecasting Techniques and Applications · Energy Load and Power Forecasting · Consumer Market Behavior and Pricing
