Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and Replenishment
Yi Zheng, Zehao Li, Peng Jiang, Yijie Peng

TL;DR
This paper introduces a dual-agent deep reinforcement learning framework for dynamic pricing and inventory replenishment, effectively handling decision frequency discrepancies and demand complexities to optimize profits.
Contribution
It presents a novel dual-agent DRL algorithm with a two-timescale scheme for joint pricing and replenishment under complex demand models.
Findings
The concavity of the profit function is established.
The integrated machine learning demand model improves accuracy.
Numerical results show the method's effectiveness in various scenarios.
Abstract
We study the dynamic pricing and replenishment problems under inconsistent decision frequencies. Different from the traditional demand assumption, the discreteness of demand and the parameter within the Poisson distribution as a function of price introduce complexity into analyzing the problem property. We demonstrate the concavity of the single-period profit function with respect to product price and inventory within their respective domains. The demand model is enhanced by integrating a decision tree-based machine learning approach, trained on comprehensive market data. Employing a two-timescale stochastic approximation scheme, we address the discrepancies in decision frequencies between pricing and replenishment, ensuring convergence to local optimum. We further refine our methodology by incorporating deep reinforcement learning (DRL) techniques and propose a fast-slow dual-agent DRL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSupply Chain and Inventory Management · Consumer Retail Behavior Studies
