Inventory Control Involving Unknown Demand of Discrete Nonperishable Items - Analysis of a Newsvendor-based Policy
Michael N. Katehakis, Jian Yang, and Tingting Zhou

TL;DR
This paper analyzes an adaptive newsvendor-based inventory policy for discrete nonperishable items with unknown demand, providing regret bounds and simulation results to evaluate its effectiveness over multiple periods.
Contribution
It introduces a regret analysis for an adaptive policy that learns demand distribution over time, offering near-optimal bounds without prior demand information.
Findings
Regret remains bounded under demand separation guarantees.
Without prior info, regret grows at most as T^{1/2+ε}.
Simulation compares policy performance with alternatives.
Abstract
Inventory control with unknown demand distribution is considered, with emphasis placed on the case involving discrete nonperishable items. We focus on an adaptive policy which in every period uses, as much as possible, the optimal newsvendor ordering quantity for the empirical distribution learned up to that period. The policy is assessed using the regret criterion, which measures the price paid for ambiguity on demand distribution over periods. When there are guarantees on the latter's separation from the critical newsvendor parameter , a constant upper bound on regret can be found. Without any prior information on the demand distribution, we show that the regret does not grow faster than the rate for any . In view of a known lower bound, this is almost the best one could hope for. Simulation studies involving this along with other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSupply Chain and Inventory Management · Advanced Queuing Theory Analysis · Advanced Bandit Algorithms Research
