AI Agents for Inventory Control: Human-LLM-OR Complementarity

Jackie Baek; Yaopeng Fu; Will Ma; Tianyi Peng

arXiv:2602.12631·cs.AI·May 6, 2026

AI Agents for Inventory Control: Human-LLM-OR Complementarity

Jackie Baek, Yaopeng Fu, Will Ma, Tianyi Peng

PDF

TL;DR

This paper explores how operations research algorithms, large language models, and humans can effectively collaborate in inventory control, demonstrating that combined approaches outperform individual methods on a comprehensive benchmark.

Contribution

It introduces InventoryBench, a new benchmark for testing inventory decisions, and shows that hybrid methods and human-AI teams outperform standalone approaches.

Findings

01

OR-augmented LLMs outperform individual methods.

02

Human-AI teams achieve higher profits than humans or AI alone.

03

A substantial fraction of individuals benefit from AI collaboration.

Abstract

Inventory control is a fundamental operations problem in which ordering decisions are traditionally guided by theoretically grounded operations research (OR) algorithms. However, such algorithms often rely on rigid modeling assumptions and can perform poorly when demand distributions shift or relevant contextual information is unavailable. Recent advances in large language models (LLMs) have generated interest in AI agents that can reason flexibly and incorporate rich contextual signals, but it remains unclear how best to incorporate LLM-based methods into traditional decision-making pipelines. We study how OR algorithms, LLMs, and humans can interact and complement each other in a multi-period inventory control setting. We construct InventoryBench, a benchmark of over 1,000 inventory instances spanning both synthetic and real-world demand data, designed to stress-test decision rules…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.