Real-Time Integrated Dispatching and Idle Fleet Steering with Deep   Reinforcement Learning for A Meal Delivery Platform

Jingyi Cheng; Shadi Sharif Azadeh

arXiv:2501.05808·eess.SY·January 14, 2025

Real-Time Integrated Dispatching and Idle Fleet Steering with Deep Reinforcement Learning for A Meal Delivery Platform

Jingyi Cheng, Shadi Sharif Azadeh

PDF

Open Access

TL;DR

This paper presents a deep reinforcement learning framework for real-time dispatching and idle courier steering in meal delivery platforms, improving efficiency and fairness by modeling the problem as Markov Decision Processes and integrating demand prediction.

Contribution

The study introduces a novel RL-based dual-control framework that jointly optimizes dispatching and courier steering with demand prediction, enhancing real-time decision-making in meal delivery.

Findings

01

Improved delivery efficiency and workload fairness.

02

Alleviated under-supply conditions in the service network.

03

Enhanced real-time operational decisions using RL policies.

Abstract

To achieve high service quality and profitability, meal delivery platforms like Uber Eats and Grubhub must strategically operate their fleets to ensure timely deliveries for current orders while mitigating the consequential impacts of suboptimal decisions that leads to courier understaffing in the future. This study set out to solve the real-time order dispatching and idle courier steering problems for a meal delivery platform by proposing a reinforcement learning (RL)-based strategic dual-control framework. To address the inherent sequential nature of these problems, we model both order dispatching and courier steering as Markov Decision Processes. Trained via a deep reinforcement learning (DRL) framework, we obtain strategic policies by leveraging the explicitly predicted demands as part of the inputs. In our dual-control framework, the dispatching and steering policies are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Manufacturing and Logistics Optimization

Methodstravel james · Sparse Evolutionary Training