Mitigating Dimensionality in 2D Rectangle Packing Problem under   Reinforcement Learning Schema

Waldemar Ko{\l}odziejczyk; Mariusz Kaleta

arXiv:2409.09677·cs.LG·September 25, 2024

Mitigating Dimensionality in 2D Rectangle Packing Problem under Reinforcement Learning Schema

Waldemar Ko{\l}odziejczyk, Mariusz Kaleta

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach with reduced state-action representations for 2D rectangle packing, achieving comparable results to heuristics and showing potential for broader applications.

Contribution

It presents a novel RL-based method using UNet and PPO with reduced representations, improving generalizability to complex packing problems.

Findings

01

Achieved packing performance comparable to MaxRect heuristic.

02

Demonstrated potential for generalization to nonrectangular packing.

03

Proposed a scalable RL framework with reduced state-action spaces.

Abstract

This paper explores the application of Reinforcement Learning (RL) to the two-dimensional rectangular packing problem. We propose a reduced representation of the state and action spaces that allow us for high granularity. Leveraging UNet architecture and Proximal Policy Optimization (PPO), we achieved a model that is comparable to the MaxRect heuristic. However, our approach has great potential to be generalized to nonrectangular packing problems and complex constraints.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReal-Time Systems Scheduling · Space Satellite Systems and Control · Distributed and Parallel Computing Systems