A Deep Reinforcement Learning Approach for the Meal Delivery Problem
Hadi Jahanshahi, Aysun Bozanta, Mucahit Cevik, Eray Mert Kavuk,, Ay\c{s}e Tosun, Sibel B. Sonuc, Bilgin Kosucu, Ay\c{s}e Ba\c{s}ar

TL;DR
This paper introduces a deep reinforcement learning model for optimizing meal delivery services, improving service quality and resource utilization by considering geographical data and dynamic customer requests.
Contribution
It presents a novel RL-based approach modeling the delivery problem as a Markov decision process, incorporating geographical info and resource management for the first time.
Findings
Significant improvement in total reward and delivery times.
Effective resource utilization and courier assignment strategies.
Robust performance across real-world scenarios.
Abstract
We consider a meal delivery service fulfilling dynamic customer requests given a set of couriers over the course of a day. A courier's duty is to pick-up an order from a restaurant and deliver it to a customer. We model this service as a Markov decision process and use deep reinforcement learning as the solution approach. We experiment with the resulting policies on synthetic and real-world datasets and compare those with the baseline policies. We also examine the courier utilization for different numbers of couriers. In our analysis, we specifically focus on the impact of the limited available resources in the meal delivery problem. Furthermore, we investigate the effect of intelligent order rejection and re-positioning of the couriers. Our numerical experiments show that, by incorporating the geographical locations of the restaurants, customers, and the depot, our model significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james
