DRL4Route: A Deep Reinforcement Learning Framework for Pick-up and Delivery Route Prediction
Xiaowei Mao, Haomin Wen, Hengrui Zhang, Huaiyu Wan, Lixia Wu, Jianbin, Zheng, Haoyuan Hu, Youfang Lin

TL;DR
This paper introduces DRL4Route, a reinforcement learning framework for pick-up and delivery route prediction, which improves accuracy by aligning training with practical test criteria, outperforming existing deep learning models.
Contribution
It is the first to apply reinforcement learning to route prediction, integrating non-differentiable objective optimization with deep learning for better practical performance.
Findings
Improves Location Square Deviation (LSD) by 0.9%-2.7%.
Enhances Accuracy@3 (ACC@3) by 2.4%-3.2%.
Demonstrates effectiveness through offline and online experiments.
Abstract
Pick-up and Delivery Route Prediction (PDRP), which aims to estimate the future service route of a worker given his current task pool, has received rising attention in recent years. Deep neural networks based on supervised learning have emerged as the dominant model for the task because of their powerful ability to capture workers' behavior patterns from massive historical data. Though promising, they fail to introduce the non-differentiable test criteria into the training process, leading to a mismatch in training and test criteria. Which considerably trims down their performance when applied in practical systems. To tackle the above issue, we present the first attempt to generalize Reinforcement Learning (RL) to the route prediction task, leading to a novel RL-based framework called DRL4Route. It combines the behavior-learning abilities of previous deep learning models with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Human Mobility and Location-Based Analysis · Urban and Freight Transport Logistics
Methodstravel james · fail
