Omni-scale Learning-based Sequential Decision Framework for Order Fulfillment of Tote-handling Robotic Systems
Jiaxin Liu, Peng Yang, Yuping Li, Xinyue Xie

TL;DR
This paper introduces a scalable, learning-based decision framework for tote-handling robots that improves efficiency and generalizes across different system configurations in logistics operations.
Contribution
It presents a novel, unified framework combining optimization and reinforcement learning for order fulfillment in tote-handling robotic systems, outperforming existing heuristics and rule-based methods.
Findings
Achieves near-optimal performance with less than 3.5% optimality gap on small systems.
Reduces tote movements by 8-12% in large-scale scenarios.
Outperforms state-of-the-art rule-based approaches while maintaining real-time responsiveness.
Abstract
Driven by the rapid expansion of e-commerce and small-batch production, the size of the intralogistics load unit of finished goods, semi-finished goods and raw materials is steadily shrinking. Totes are gradually replacing pallets as the primary handling and storage container. This shift has propelled tote-handling robotic systems to the forefront of automation order fulfillment centers. The order-fulfillment decisions of tote-handling robotic systems share a common order-tote-robot sequential decision-making nature. Existing studies primarily focus on decision mechanisms tailored to particular systems, making it difficult to generalize or transfer them to other contexts. We propose an Omni-scale Learning-based Sequential Decision Framework for Order Fulfillment of Tote-handling Robotic Systems (OLSF-TRS), a generalized and scalable sequential decision framework that combines structured…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
