BMG-Q: Localized Bipartite Match Graph Attention Q-Learning for Ride-Pooling Order Dispatch

Yulong Hu; Siyuan Feng; and Sen Li

arXiv:2501.13448·cs.MA·February 3, 2026

BMG-Q: Localized Bipartite Match Graph Attention Q-Learning for Ride-Pooling Order Dispatch

Yulong Hu, Siyuan Feng, and Sen Li

PDF

Open Access

TL;DR

BMG-Q is a novel multi-agent reinforcement learning framework that uses localized bipartite graph attention and ILP optimization to improve ride-pooling order dispatch, achieving higher rewards and reduced bias.

Contribution

It introduces a new MARL algorithm with graph attention and ILP optimization for ride-pooling, enhancing scalability, robustness, and decision quality.

Findings

01

Outperforms benchmark RL frameworks by ~10% in rewards.

02

Reduces overestimation bias by over 50%.

03

Maintains robustness across task and fleet variations.

Abstract

This paper introduces Localized Bipartite Match Graph Attention Q-Learning (BMG-Q), a novel Multi-Agent Reinforcement Learning (MARL) algorithm framework tailored for ride-pooling order dispatch. BMG-Q advances ride-pooling decision-making process with the localized bipartite match graph underlying the Markov Decision Process, enabling the development of novel Graph Attention Double Deep Q Network (GATDDQN) as the MARL backbone to capture the dynamic interactions among ride-pooling vehicles in fleet. Our approach enriches the state information for each agent with GATDDQN by leveraging a localized bipartite interdependence graph and enables a centralized global coordinator to optimize order matching and agent behavior using Integer Linear Programming (ILP). Enhanced by gradient clipping and localized graph sampling, our GATDDQN improves scalability and robustness. Furthermore, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsElectric Vehicles and Infrastructure · Elevator Systems and Control

MethodsSoftmax · Attention Is All You Need · Q-Learning · Gradient Clipping