AI Agents for Inventory Control: Human-LLM-OR Complementarity
Jackie Baek, Yaopeng Fu, Will Ma, Tianyi Peng

TL;DR
This paper explores how operations research algorithms, large language models, and humans can effectively collaborate in inventory control, demonstrating that combined approaches outperform individual methods on a comprehensive benchmark.
Contribution
It introduces InventoryBench, a new benchmark for testing inventory decisions, and shows that hybrid methods and human-AI teams outperform standalone approaches.
Findings
OR-augmented LLMs outperform individual methods.
Human-AI teams achieve higher profits than humans or AI alone.
A substantial fraction of individuals benefit from AI collaboration.
Abstract
Inventory control is a fundamental operations problem in which ordering decisions are traditionally guided by theoretically grounded operations research (OR) algorithms. However, such algorithms often rely on rigid modeling assumptions and can perform poorly when demand distributions shift or relevant contextual information is unavailable. Recent advances in large language models (LLMs) have generated interest in AI agents that can reason flexibly and incorporate rich contextual signals, but it remains unclear how best to incorporate LLM-based methods into traditional decision-making pipelines. We study how OR algorithms, LLMs, and humans can interact and complement each other in a multi-period inventory control setting. We construct InventoryBench, a benchmark of over 1,000 inventory instances spanning both synthetic and real-world demand data, designed to stress-test decision rules…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
