Structure-Informed Deep Reinforcement Learning for Inventory Management

Alvaro Maggiar; Sohrab Andaz; Akhil Bagaria; Carson Eisenach; Dean Foster; Omer Gottesman; Dominique Perrault-Joncas

arXiv:2507.22040·cs.LG·July 30, 2025

Structure-Informed Deep Reinforcement Learning for Inventory Management

Alvaro Maggiar, Sohrab Andaz, Akhil Bagaria, Carson Eisenach, Dean Foster, Omer Gottesman, Dominique Perrault-Joncas

PDF

TL;DR

This paper applies deep reinforcement learning to various inventory management problems, demonstrating competitive performance, interpretability, and the integration of structural insights for practical, data-driven decision-making.

Contribution

It introduces a structure-informed DRL approach that incorporates analytical policy characteristics, enhancing interpretability and robustness in inventory management applications.

Findings

01

DRL outperforms traditional heuristics across multiple scenarios.

02

The approach captures known optimal policy structures.

03

Incorporating analytical insights improves interpretability and robustness.

Abstract

This paper investigates the application of Deep Reinforcement Learning (DRL) to classical inventory management problems, with a focus on practical implementation considerations. We apply a DRL algorithm based on DirectBackprop to several fundamental inventory management scenarios including multi-period systems with lost sales (with and without lead times), perishable inventory management, dual sourcing, and joint inventory procurement and removal. The DRL approach learns policies across products using only historical information that would be available in practice, avoiding unrealistic assumptions about demand distributions or access to distribution parameters. We demonstrate that our generic DRL implementation performs competitively against or outperforms established benchmarks and heuristics across these diverse settings, while requiring minimal parameter tuning. Through examination…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.