Coordinating Ride-Pooling with Public Transit using Reward-Guided   Conservative Q-Learning: An Offline Training and Online Fine-Tuning   Reinforcement Learning Framework

Yulong Hu; Tingting Dong; and Sen Li

arXiv:2501.14199·cs.LG·January 27, 2025

Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework

Yulong Hu, Tingting Dong, and Sen Li

PDF

Open Access

TL;DR

This paper presents RG-CQL, a reinforcement learning framework that improves coordination between ride-pooling and public transit through offline training and online fine-tuning, achieving higher rewards and data efficiency in real-world scenarios.

Contribution

The paper introduces a novel RL framework combining Conservative Double Deep Q Networks and a supervised reward estimator for multimodal transit coordination.

Findings

01

Achieves 17% and 22% higher rewards compared to benchmarks.

02

81.3% improvement in data efficiency over traditional RL methods.

03

Effectively transitions from offline to online RL in large-scale systems.

Abstract

This paper introduces a novel reinforcement learning (RL) framework, termed Reward-Guided Conservative Q-learning (RG-CQL), to enhance coordination between ride-pooling and public transit within a multimodal transportation network. We model each ride-pooling vehicle as an agent governed by a Markov Decision Process (MDP) and propose an offline training and online fine-tuning RL framework to learn the optimal operational decisions of the multimodal transportation systems, including rider-vehicle matching, selection of drop-off locations for passengers, and vehicle routing decisions, with improved data efficiency. During the offline training phase, we develop a Conservative Double Deep Q Network (CDDQN) as the action executor and a supervised learning-based reward estimator, termed the Guider Network, to extract valuable insights into action-reward relationships from data batches. In the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTransportation and Mobility Innovations · Transportation Planning and Optimization · Smart Parking Systems Research

MethodsQ-Learning