Structure-Informed Deep Reinforcement Learning for Inventory Management
Alvaro Maggiar, Sohrab Andaz, Akhil Bagaria, Carson Eisenach, Dean Foster, Omer Gottesman, Dominique Perrault-Joncas

TL;DR
This paper applies deep reinforcement learning to various inventory management problems, demonstrating competitive performance, interpretability, and the integration of structural insights for practical, data-driven decision-making.
Contribution
It introduces a structure-informed DRL approach that incorporates analytical policy characteristics, enhancing interpretability and robustness in inventory management applications.
Findings
DRL outperforms traditional heuristics across multiple scenarios.
The approach captures known optimal policy structures.
Incorporating analytical insights improves interpretability and robustness.
Abstract
This paper investigates the application of Deep Reinforcement Learning (DRL) to classical inventory management problems, with a focus on practical implementation considerations. We apply a DRL algorithm based on DirectBackprop to several fundamental inventory management scenarios including multi-period systems with lost sales (with and without lead times), perishable inventory management, dual sourcing, and joint inventory procurement and removal. The DRL approach learns policies across products using only historical information that would be available in practice, avoiding unrealistic assumptions about demand distributions or access to distribution parameters. We demonstrate that our generic DRL implementation performs competitively against or outperforms established benchmarks and heuristics across these diverse settings, while requiring minimal parameter tuning. Through examination…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
