Online 3D Bin Packing Reinforcement Learning Solution with Buffer
Aaron Valero Puche, Sukhan Lee

TL;DR
This paper introduces a reinforcement learning framework with a buffer and data augmentation for 3D bin packing, achieving superior space utilization compared to existing methods.
Contribution
It presents a novel RL approach with multi-item action buffer and symmetry-based data augmentation, inspired by AlphaGo, for improved 3D bin packing performance.
Findings
Outperforms state-of-the-art in space utilization
Efficient training with single thread and GPU
Effective use of buffer and data augmentation strategies
Abstract
The 3D Bin Packing Problem (3D-BPP) is one of the most demanded yet challenging problems in industry, where an agent must pack variable size items delivered in sequence into a finite bin with the aim to maximize the space utilization. It represents a strongly NP-Hard optimization problem such that no solution has been offered to date with high performance in space utilization. In this paper, we present a new reinforcement learning (RL) framework for a 3D-BPP solution for improving performance. First, a buffer is introduced to allow multi-item action selection. By increasing the degree of freedom in action selection, a more complex policy that results in better packing performance can be derived. Second, we propose an agnostic data augmentation strategy that exploits both bin item symmetries for improving sample efficiency. Third, we implement a model-based RL method adapted from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAssembly Line Balancing Optimization · Optimization and Packing Problems · Advanced Manufacturing and Logistics Optimization
