In-Context Learning for Data-Driven Censored Inventory Control
Sohom Mukherjee, Anh-Duy Pham, Richard Pibernik, Yunbei Xu

TL;DR
This paper introduces ICGPS, a novel in-context generative posterior sampling method for censored inventory control, combining offline meta-training with online autoregressive generation to improve robustness and performance.
Contribution
It proposes a new generative approach that leverages offline meta-training and in-context generation, providing theoretical regret bounds and practical effectiveness in censored inventory problems.
Findings
ICGPS matches known TS benchmarks with ideal completion kernels.
Outperforms myopic and UCB baselines in benchmark tests.
Demonstrates robustness to prior mismatch and distribution shift.
Abstract
We study inventory control with decision-dependent censoring, focusing on the censored or repeated newsvendor (R-NV), where each order quantity determines whether demand is fully observed or censored by sales. Existing approaches based on parametric Thompson sampling (TS) can be brittle under prior mismatch, while offline imputation methods need not transfer to online learning. Motivated by the predictive view of decision making, we combine these ideas by taking oracle actions on learned completions of latent demand. We propose in-context generative posterior sampling (ICGPS), which uses modern generative models that are meta-trained offline and deployed online by in-context autoregressive generation. Theoretically, we show that the Bayesian regret of ICGPS with a learned completion kernel is bounded by the Bayesian regret of a TS benchmark with the ideal completion kernel plus a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
