Reinforcement Learning in the Wild: Scalable RL Dispatching Algorithm Deployed in Ridehailing Marketplace
Soheil Sadeghi Eshkevari, Xiaocheng Tang, Zhiwei Qin, Jinhan Mei,, Cheng Zhang, Qianying Meng, Jia Xu

TL;DR
This paper presents a scalable reinforcement learning-based dispatching algorithm for ridehailing platforms, demonstrating significant performance improvements and successful large-scale deployment in real-world markets.
Contribution
It introduces a novel RL dispatching solution with enhanced value updating, utility functions, and adaptive graph pruning, enabling scalable and robust deployment in ridehailing.
Findings
Over 1.3% increase in driver income in online deployment
Up to 5.3% improvement in key performance metrics
Successful large-scale deployment in multiple cities
Abstract
In this study, a real-time dispatching algorithm based on reinforcement learning is proposed and for the first time, is deployed in large scale. Current dispatching methods in ridehailing platforms are dominantly based on myopic or rule-based non-myopic approaches. Reinforcement learning enables dispatching policies that are informed of historical data and able to employ the learned information to optimize returns of expected future trajectories. Previous studies in this field yielded promising results, yet have left room for further improvements in terms of performance gain, self-dependency, transferability, and scalable deployment mechanisms. The present study proposes a standalone RL-based dispatching solution that is equipped with multiple mechanisms to ensure robust and efficient on-policy learning and inference while being adaptable for full-scale deployment. A new form of value…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation and Mobility Innovations · Transportation Planning and Optimization · Electric Vehicles and Infrastructure
MethodsPruning
