Learning Practically Feasible Policies for Online 3D Bin Packing
Hang Zhao, Chenyang Zhu, Xin Xu, Hui Huang, Kai Xu

TL;DR
This paper presents a deep reinforcement learning approach to the online 3D bin packing problem, achieving high packing efficiency and practical feasibility for real-world robotic applications.
Contribution
It introduces a novel stacking tree analysis, decoupled dimension-wise policy learning, and a reward design to improve packing stability and collision avoidance in online 3D bin packing.
Findings
Outperforms state-of-the-art methods significantly
Achieves high packing stability and efficiency
Demonstrates practical usability in real-world scenarios
Abstract
We tackle the Online 3D Bin Packing Problem, a challenging yet practically useful variant of the classical Bin Packing Problem. In this problem, the items are delivered to the agent without informing the full sequence information. Agent must directly pack these items into the target bin stably without changing their arrival order, and no further adjustment is permitted. Online 3D-BPP can be naturally formulated as Markov Decision Process (MDP). We adopt deep reinforcement learning, in particular, the on-policy actor-critic framework, to solve this MDP with constrained action space. To learn a practically feasible packing policy, we propose three critical designs. First, we propose an online analysis of packing stability based on a novel stacking tree. It attains a high analysis accuracy while reducing the computational complexity from to , making it especially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Packing Problems · Advanced Manufacturing and Logistics Optimization · Robotic Path Planning Algorithms
