Deep Q-Learning for Same-Day Delivery with Vehicles and Drones
Xinwei Chen, Marlin W. Ulmer, Barrett W. Thomas

TL;DR
This paper introduces a deep Q-learning approach for optimizing same-day delivery using vehicles and drones, effectively managing dynamic requests and fleet heterogeneity in urban environments.
Contribution
It presents a novel deep reinforcement learning method that dynamically assigns deliveries to vehicles or drones, outperforming benchmark policies in complex delivery scenarios.
Findings
Deep Q-learning outperforms benchmark policies in delivery efficiency.
The policy maintains effectiveness with moderate fleet size changes.
Trained policies adapt well to varied spatial and temporal data distributions.
Abstract
In this paper, we consider same-day delivery with vehicles and drones. Customers make delivery requests over the course of the day, and the dispatcher dynamically dispatches vehicles and drones to deliver the goods to customers before their delivery deadline. Vehicles can deliver multiple packages in one route but travel relatively slowly due to the urban traffic. Drones travel faster, but they have limited capacity and require charging or battery swaps. To exploit the different strengths of the fleets, we propose a deep Q-learning approach. Our method learns the value of assigning a new customer to either drones or vehicles as well as the option to not offer service at all. In a systematic computational analysis, we show the superiority of our policy compared to benchmark policies and the effectiveness of our deep Q-learning approach. We also show that our policy can maintain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsQ-Learning
