Enhancing Retail Sales Forecasting with Optimized Machine Learning Models
Priyam Ganguly, Isha Mukherjee

TL;DR
This paper demonstrates that optimizing Random Forest models with hyperparameter tuning significantly improves retail sales forecasting accuracy over traditional methods and other machine learning models.
Contribution
The study introduces an optimized RF model with hyperparameter tuning that effectively captures complex sales data patterns, outperforming existing models in accuracy.
Findings
Optimized RF achieved R-squared of 0.945, higher than traditional LR.
Model reduced RMSLE to 1.172, indicating better prediction accuracy.
Outperformed GB, SVR, and XGBoost models in forecasting performance.
Abstract
In retail sales forecasting, accurately predicting future sales is crucial for inventory management and strategic planning. Traditional methods like LR often fall short due to the complexity of sales data, which includes seasonality and numerous product families. Recent advancements in machine learning (ML) provide more robust alternatives. This research benefits from the power of ML, particularly Random Forest (RF), Gradient Boosting (GB), Support Vector Regression (SVR), and XGBoost, to improve prediction accuracy. Despite advancements, a significant gap exists in handling complex datasets with high seasonality and multiple product families. The proposed solution involves implementing and optimizing a RF model, leveraging hyperparameter tuning through randomized search cross-validation. This approach addresses the complexities of the dataset, capturing intricate patterns that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence · Forecasting Techniques and Applications
