Vehicle Routing with Finite Time Horizon using Deep Reinforcement Learning with Improved Network Embedding
Ayan Maity, Sudeshna Sarkar

TL;DR
This paper introduces a deep reinforcement learning approach with an improved network embedding for vehicle routing within a finite time horizon, enhancing service rate and reducing solution time.
Contribution
It proposes a novel network embedding module that incorporates finite time horizon context into deep reinforcement learning for vehicle routing.
Findings
Higher customer service rate compared to existing methods
Significantly lower solution time
Effective on real-world and synthetic networks
Abstract
In this paper, we study the vehicle routing problem with a finite time horizon. In this routing problem, the objective is to maximize the number of customer requests served within a finite time horizon. We present a novel routing network embedding module which creates local node embedding vectors and a context-aware global graph representation. The proposed Markov decision process for the vehicle routing problem incorporates the node features, the network adjacency matrix and the edge features as components of the state space. We incorporate the remaining finite time horizon into the network embedding module to provide a proper routing context to the embedding module. We integrate our embedding module with a policy gradient-based deep Reinforcement Learning framework to solve the vehicle routing problem with finite time horizon. We trained and validated our proposed routing method on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsVehicle Routing Optimization Methods · Software-Defined Networks and 5G · Transportation and Mobility Innovations
