Real-Time Integrated Dispatching and Idle Fleet Steering with Deep Reinforcement Learning for A Meal Delivery Platform
Jingyi Cheng, Shadi Sharif Azadeh

TL;DR
This paper presents a deep reinforcement learning framework for real-time dispatching and idle courier steering in meal delivery platforms, improving efficiency and fairness by modeling the problem as Markov Decision Processes and integrating demand prediction.
Contribution
The study introduces a novel RL-based dual-control framework that jointly optimizes dispatching and courier steering with demand prediction, enhancing real-time decision-making in meal delivery.
Findings
Improved delivery efficiency and workload fairness.
Alleviated under-supply conditions in the service network.
Enhanced real-time operational decisions using RL policies.
Abstract
To achieve high service quality and profitability, meal delivery platforms like Uber Eats and Grubhub must strategically operate their fleets to ensure timely deliveries for current orders while mitigating the consequential impacts of suboptimal decisions that leads to courier understaffing in the future. This study set out to solve the real-time order dispatching and idle courier steering problems for a meal delivery platform by proposing a reinforcement learning (RL)-based strategic dual-control framework. To address the inherent sequential nature of these problems, we model both order dispatching and courier steering as Markov Decision Processes. Trained via a deep reinforcement learning (DRL) framework, we obtain strategic policies by leveraging the explicitly predicted demands as part of the inputs. In our dual-control framework, the dispatching and steering policies are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Manufacturing and Logistics Optimization
Methodstravel james · Sparse Evolutionary Training
