Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework
Yulong Hu, Tingting Dong, and Sen Li

TL;DR
This paper presents RG-CQL, a reinforcement learning framework that improves coordination between ride-pooling and public transit through offline training and online fine-tuning, achieving higher rewards and data efficiency in real-world scenarios.
Contribution
The paper introduces a novel RL framework combining Conservative Double Deep Q Networks and a supervised reward estimator for multimodal transit coordination.
Findings
Achieves 17% and 22% higher rewards compared to benchmarks.
81.3% improvement in data efficiency over traditional RL methods.
Effectively transitions from offline to online RL in large-scale systems.
Abstract
This paper introduces a novel reinforcement learning (RL) framework, termed Reward-Guided Conservative Q-learning (RG-CQL), to enhance coordination between ride-pooling and public transit within a multimodal transportation network. We model each ride-pooling vehicle as an agent governed by a Markov Decision Process (MDP) and propose an offline training and online fine-tuning RL framework to learn the optimal operational decisions of the multimodal transportation systems, including rider-vehicle matching, selection of drop-off locations for passengers, and vehicle routing decisions, with improved data efficiency. During the offline training phase, we develop a Conservative Double Deep Q Network (CDDQN) as the action executor and a supervised learning-based reward estimator, termed the Guider Network, to extract valuable insights into action-reward relationships from data batches. In the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation and Mobility Innovations · Transportation Planning and Optimization · Smart Parking Systems Research
MethodsQ-Learning
