Solving a New 3D Bin Packing Problem with Deep Reinforcement Learning Method
Haoyuan Hu, Xiaodong Zhang, Xiaowei Yan, Longfei Wang, Yinghui Xu

TL;DR
This paper introduces a novel 3D bin packing problem focused on minimizing surface area by optimizing item sequences, solved using deep reinforcement learning, specifically Pointer Networks, achieving about 5% better results than heuristics.
Contribution
The paper proposes a new 3D bin packing problem emphasizing surface area minimization and applies a DRL-based sequence optimization method, demonstrating improved performance over heuristics.
Findings
Achieved approximately 5% improvement over heuristic methods.
Identified sequence as a key factor in surface area minimization.
Validated the effectiveness of DRL in complex packing problems.
Abstract
In this paper, a new type of 3D bin packing problem (BPP) is proposed, in which a number of cuboid-shaped items must be put into a bin one by one orthogonally. The objective is to find a way to place these items that can minimize the surface area of the bin. This problem is based on the fact that there is no fixed-sized bin in many real business scenarios and the cost of a bin is proportional to its surface area. Our research shows that this problem is NP-hard. Based on previous research on 3D BPP, the surface area is determined by the sequence, spatial locations and orientations of items. Among these factors, the sequence of items plays a key role in minimizing the surface area. Inspired by recent achievements of deep reinforcement learning (DRL) techniques, especially Pointer Network, on combinatorial optimization problems such as TSP, a DRL-based method is applied to optimize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Packing Problems · Advanced Manufacturing and Logistics Optimization · Manufacturing Process and Optimization
MethodsSigmoid Activation · Tanh Activation · Softmax · Long Short-Term Memory · [LivE@PeRson]How do I talk to a real person at Expedia? · Pointer Network
