Online 3D Bin Packing Reinforcement Learning Solution with Buffer

Aaron Valero Puche; Sukhan Lee

arXiv:2208.07123·cs.RO·August 16, 2022·1 cites

Online 3D Bin Packing Reinforcement Learning Solution with Buffer

Aaron Valero Puche, Sukhan Lee

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning framework with a buffer and data augmentation for 3D bin packing, achieving superior space utilization compared to existing methods.

Contribution

It presents a novel RL approach with multi-item action buffer and symmetry-based data augmentation, inspired by AlphaGo, for improved 3D bin packing performance.

Findings

01

Outperforms state-of-the-art in space utilization

02

Efficient training with single thread and GPU

03

Effective use of buffer and data augmentation strategies

Abstract

The 3D Bin Packing Problem (3D-BPP) is one of the most demanded yet challenging problems in industry, where an agent must pack variable size items delivered in sequence into a finite bin with the aim to maximize the space utilization. It represents a strongly NP-Hard optimization problem such that no solution has been offered to date with high performance in space utilization. In this paper, we present a new reinforcement learning (RL) framework for a 3D-BPP solution for improving performance. First, a buffer is introduced to allow multi-item action selection. By increasing the degree of freedom in action selection, a more complex policy that results in better packing performance can be derived. Second, we propose an agnostic data augmentation strategy that exploits both bin item symmetries for improving sample efficiency. Third, we implement a model-based RL method adapted from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAssembly Line Balancing Optimization · Optimization and Packing Problems · Advanced Manufacturing and Logistics Optimization