Evolutionary RL for Container Loading

S Saikia; R Verma; P Agarwal; G Shroff; L Vig; A Srinivasan

arXiv:1805.06664·cs.AI·May 18, 2018·1 cites

Evolutionary RL for Container Loading

S Saikia, R Verma, P Agarwal, G Shroff, L Vig, A Srinivasan

PDF

Open Access

TL;DR

This paper introduces a hybrid evolutionary and reinforcement learning approach to optimize container loading sequences, outperforming heuristics and generalizing well to unseen problems in port operations.

Contribution

It presents a novel combination of Evolutionary Strategies and Policy Gradient RL to approximate optimal container loading sequences, improving over traditional heuristics.

Findings

01

RL agent learns near-optimal solutions

02

Outperforms heuristic solutions in experiments

03

Generalizes better with a solution pool

Abstract

Loading the containers on the ship from a yard, is an impor- tant part of port operations. Finding the optimal sequence for the loading of containers, is known to be computationally hard and is an example of combinatorial optimization, which leads to the application of simple heuristics in practice. In this paper, we propose an approach which uses a mix of Evolutionary Strategies and Reinforcement Learning (RL) tech- niques to find an approximation of the optimal solution. The RL based agent uses the Policy Gradient method, an evolutionary reward strategy and a Pool of good (not-optimal) solutions to find the approximation. We find that the RL agent learns near-optimal solutions that outperforms the heuristic solutions. We also observe that the RL agent assisted with a pool generalizes better for unseen problems than an RL agent without a pool. We present our results on synthetic data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMetaheuristic Optimization Algorithms Research · Maritime Ports and Logistics · Optimization and Packing Problems