Data-driven inventory management for new products: An adjusted Dyna-$Q$ approach with transfer learning
Xinye Qu, Longxiao Liu, Wenjie Huang

TL;DR
This paper introduces an adjusted Dyna-Q reinforcement learning algorithm enhanced with transfer learning for inventory management of new products, achieving faster training and lower costs compared to existing methods.
Contribution
It develops a novel transfer learning-augmented Dyna-Q algorithm tailored for new product inventory management, improving training speed and cost efficiency.
Findings
Up to 23.7% reduction in average daily cost
Up to 77.5% reduction in training time
Lowest total cost and variance among benchmarks
Abstract
In this paper, we propose a novel reinforcement learning algorithm for inventory management of newly launched products with no historical demand information. The algorithm follows the classic Dyna- structure, balancing the model-free and model-based approaches, while accelerating the training process of Dyna- and mitigating the model discrepancy generated by the model-based feedback. Based on the idea of transfer learning, warm-start information from the demand data of existing similar products can be incorporated into the algorithm to further stabilize the early-stage training and reduce the variance of the estimated optimal policy. Our approach is validated through a case study of bakery inventory management with real data. The adjusted Dyna- shows up to a 23.7\% reduction in average daily cost compared with -learning, and up to a 77.5\% reduction in training time within…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence
