Reinforcement Learning for Multi-Truck Vehicle Routing Problems

Joshua Levin (1); Randall Correll (1); Takanori Ide (2); Takafumi; Suzuki (3); Saito Takaho (3); Alan Arai (4) ((1) QC Ware Corp Palo Alto; (2); Department of Mathematics; Information Science; Josai University; Tokyo,; (3) AISIN CORPORATION Tokyo; (4) Aisin Technical Center of America San Jose)

arXiv:2211.17078·cs.LG·December 31, 2024·1 cites

Reinforcement Learning for Multi-Truck Vehicle Routing Problems

Joshua Levin (1), Randall Correll (1), Takanori Ide (2), Takafumi, Suzuki (3), Saito Takaho (3), Alan Arai (4) ((1) QC Ware Corp Palo Alto, (2), Department of Mathematics, Information Science, Josai University, Tokyo,, (3) AISIN CORPORATION Tokyo

PDF

Open Access

TL;DR

This paper introduces an advanced deep reinforcement learning approach with encoder-decoder models tailored for complex multi-truck vehicle routing problems involving multi-leg routes, demonstrating improved solutions in real-world supply chain scenarios.

Contribution

The paper develops new extensions to encoder-decoder attention models enabling RL to handle multi-truck, multi-leg VRPs, scalable from small to large supply chain problems.

Findings

01

Our models outperform previous solutions in a real automotive supply chain environment.

02

The approach effectively generalizes from small to large problem instances.

03

Deep RL can be adapted for complex, real-world vehicle routing challenges.

Abstract

Deep reinforcement learning (RL) has been shown to be effective in producing approximate solutions to some vehicle routing problems (VRPs), especially when using policies generated by encoder-decoder attention mechanisms. While these techniques have been quite successful for relatively simple problem instances, there are still under-researched and highly complex VRP variants for which no effective RL method has been demonstrated. In this work we focus on one such VRP variant, which contains multiple trucks and multi-leg routing requirements. In these problems, demand is required to move along sequences of nodes, instead of just from a start node to an end node. With the goal of making deep RL a viable strategy for real-world industrial-scale supply chain logistics, we develop new extensions to existing encoder-decoder attention models which allow them to handle multiple trucks and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Supply Chain and Inventory Management · Reinforcement Learning in Robotics