Mutual Information as Intrinsic Reward of Reinforcement Learning Agents for On-demand Ride Pooling
Xianjie Zhang, Jiahao Sun, Chen Gong, Kai Wang, Yifei Cao, Hao Chen,, Hao Chen, Yu Liu

TL;DR
This paper introduces a reinforcement learning framework for vehicle dispatching in ride pooling services, utilizing mutual information as an intrinsic reward to improve matching efficiency and increase revenue.
Contribution
It proposes a novel RL-based dispatching method that incorporates mutual information as an intrinsic reward to better match vehicle and request distributions.
Findings
Revenue increased by up to 3% over existing methods
Mutual information improves request-vehicle distribution correlation
Framework effectively handles unusual request distributions
Abstract
The emergence of on-demand ride pooling services allows each vehicle to serve multiple passengers at a time, thus increasing drivers' income and enabling passengers to travel at lower prices than taxi/car on-demand services (only one passenger can be assigned to a car at a time like UberX and Lyft). Although on-demand ride pooling services can bring so many benefits, ride pooling services need a well-defined matching strategy to maximize the benefits for all parties (passengers, drivers, aggregation companies and environment), in which the regional dispatching of vehicles has a significant impact on the matching and revenue. Existing algorithms often only consider revenue maximization, which makes it difficult for requests with unusual distribution to get a ride. How to increase revenue while ensuring a reasonable assignment of requests brings a challenge to ride pooling service…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation and Mobility Innovations · Sharing Economy and Platforms · Transportation Planning and Optimization
Methodstravel james · Emirates Airlines Office in Dubai
