A Decentralised Multi-Agent Reinforcement Learning Approach for the Same-Day Delivery Problem
Elvin Ngu, Leandro Parada, Jose Javier Escribano Macias, Panagiotis, Angeloudis

TL;DR
This paper introduces a decentralised multi-agent reinforcement learning method for same-day delivery routing, demonstrating comparable performance to exact methods at a fraction of the computational cost, especially in larger, dynamic scenarios.
Contribution
It formulates the delivery problem as an MDP and applies a parameter-sharing Deep Q-Network within a multi-agent framework, advancing scalable real-time routing solutions.
Findings
MARL performs similarly to MIP for low order volumes.
MARL underperforms MIP by up to 30% in high order scenarios.
MARL is 65 times faster than MIP, suitable for real-time applications.
Abstract
Same-Day Delivery services are becoming increasingly popular in recent years. These have been usually modelled by previous studies as a certain class of Dynamic Vehicle Routing Problem (DVRP) where goods must be delivered from a depot to a set of customers in the same day that the orders were placed. Adaptive exact solution methods for DVRPs can become intractable even for small problem instances. In this paper, we formulate the SDDP as a Markov Decision Process (MDP) and solve it using a parameter-sharing Deep Q-Network, which corresponds to a decentralised Multi-Agent Reinforcement Learning (MARL) approach. For this, we create a multi-agent grid-based SDD environment, consisting of multiple vehicles, a central depot and dynamic order generation. In addition, we introduce zone-specific order generation and reward probabilities. We compare the performance of our proposed MARL approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation and Mobility Innovations · Traffic control and management · Transportation Planning and Optimization
