Reinforcement Learning in the Wild: Scalable RL Dispatching Algorithm   Deployed in Ridehailing Marketplace

Soheil Sadeghi Eshkevari; Xiaocheng Tang; Zhiwei Qin; Jinhan Mei,; Cheng Zhang; Qianying Meng; Jia Xu

arXiv:2202.05118·cs.LG·February 11, 2022·5 cites

Reinforcement Learning in the Wild: Scalable RL Dispatching Algorithm Deployed in Ridehailing Marketplace

Soheil Sadeghi Eshkevari, Xiaocheng Tang, Zhiwei Qin, Jinhan Mei,, Cheng Zhang, Qianying Meng, Jia Xu

PDF

Open Access

TL;DR

This paper presents a scalable reinforcement learning-based dispatching algorithm for ridehailing platforms, demonstrating significant performance improvements and successful large-scale deployment in real-world markets.

Contribution

It introduces a novel RL dispatching solution with enhanced value updating, utility functions, and adaptive graph pruning, enabling scalable and robust deployment in ridehailing.

Findings

01

Over 1.3% increase in driver income in online deployment

02

Up to 5.3% improvement in key performance metrics

03

Successful large-scale deployment in multiple cities

Abstract

In this study, a real-time dispatching algorithm based on reinforcement learning is proposed and for the first time, is deployed in large scale. Current dispatching methods in ridehailing platforms are dominantly based on myopic or rule-based non-myopic approaches. Reinforcement learning enables dispatching policies that are informed of historical data and able to employ the learned information to optimize returns of expected future trajectories. Previous studies in this field yielded promising results, yet have left room for further improvements in terms of performance gain, self-dependency, transferability, and scalable deployment mechanisms. The present study proposes a standalone RL-based dispatching solution that is equipped with multiple mechanisms to ensure robust and efficient on-policy learning and inference while being adaptable for full-scale deployment. A new form of value…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTransportation and Mobility Innovations · Transportation Planning and Optimization · Electric Vehicles and Infrastructure

MethodsPruning