InvEvolve: Evolving White-Box Inventory Policies via Large Language Models with Performance Guarantees
Chenyu Huang, Jianghao Lin, Zhengyang Tang, Bo Jiang, Ruoqing Jiang, Benyou Wang, Lai Wei

TL;DR
InvEvolve leverages large language models to evolve inventory policies with statistical safety guarantees, outperforming classical and deep learning methods in dynamic retail environments.
Contribution
We introduce InvEvolve, a novel framework that uses LLMs with confidence intervals to generate safe, effective inventory policies in non-stationary settings.
Findings
InvEvolve outperforms classical inventory policies.
It improves upon existing benchmarks in real-world data.
Provides statistical safety guarantees for policy deployment.
Abstract
We study how large language models can be used to evolve inventory policies in online, non-stationary environments. Our work is motivated by recent advances in LLM-based evolutionary search, such as AlphaEvolve, which demonstrates strong performance for static and highly structured problems such as mathematical discovery, but is not directly suited to online dynamic inventory settings. To this end, we propose InvEvolve, an end-to-end inventory policy evolution and inference framework grounded in confidence-interval-based certification. Built on a large language model trained via reinforcement learning, InvEvolve can process demand data together with additional numerical and textual features, and generates white-box inventory policies with statistical safety guarantees for deployment in future periods. We further introduce a unified theoretical model that connects training, inference,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
