Effective Online 3D Bin Packing with Lookahead Parcels Using Monte Carlo Tree Search
Jiangyi Fang, Bowen Zhou, Haotian Wang, Xin Zhu, Leye Wang

TL;DR
This paper introduces a novel Monte Carlo Tree Search-based framework for online 3D bin packing that leverages lookahead information to adapt to distribution shifts, significantly improving efficiency and robustness in logistics applications.
Contribution
It formulates online 3D bin packing with lookahead parcels as a Model Predictive Control problem and develops a MCTS framework with a dynamic exploration prior and auxiliary reward.
Findings
Achieves over 10% performance gains under distributional shifts
Provides 4% average improvement in online deployment
Demonstrates up to 8% best-case improvement
Abstract
Online 3D Bin Packing (3D-BP) with robotic arms is crucial for reducing transportation and labor costs in modern logistics. While Deep Reinforcement Learning (DRL) has shown strong performance, it often fails to adapt to real-world short-term distribution shifts, which arise as different batches of goods arrive sequentially, causing performance drops. We argue that the short-term lookahead information available in modern logistics systems is key to mitigating this issue, especially during distribution shifts. We formulate online 3D-BP with lookahead parcels as a Model Predictive Control (MPC) problem and adapt the Monte Carlo Tree Search (MCTS) framework to solve it. Our framework employs a dynamic exploration prior that automatically balances a learned RL policy and a robust random policy based on the lookahead characteristics. Additionally, we design an auxiliary reward to penalize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Packing Problems · Digital Transformation in Industry · Vehicle Routing Optimization Methods
