BMG-Q: Localized Bipartite Match Graph Attention Q-Learning for Ride-Pooling Order Dispatch
Yulong Hu, Siyuan Feng, and Sen Li

TL;DR
BMG-Q is a novel multi-agent reinforcement learning framework that uses localized bipartite graph attention and ILP optimization to improve ride-pooling order dispatch, achieving higher rewards and reduced bias.
Contribution
It introduces a new MARL algorithm with graph attention and ILP optimization for ride-pooling, enhancing scalability, robustness, and decision quality.
Findings
Outperforms benchmark RL frameworks by ~10% in rewards.
Reduces overestimation bias by over 50%.
Maintains robustness across task and fleet variations.
Abstract
This paper introduces Localized Bipartite Match Graph Attention Q-Learning (BMG-Q), a novel Multi-Agent Reinforcement Learning (MARL) algorithm framework tailored for ride-pooling order dispatch. BMG-Q advances ride-pooling decision-making process with the localized bipartite match graph underlying the Markov Decision Process, enabling the development of novel Graph Attention Double Deep Q Network (GATDDQN) as the MARL backbone to capture the dynamic interactions among ride-pooling vehicles in fleet. Our approach enriches the state information for each agent with GATDDQN by leveraging a localized bipartite interdependence graph and enables a centralized global coordinator to optimize order matching and agent behavior using Integer Linear Programming (ILP). Enhanced by gradient clipping and localized graph sampling, our GATDDQN improves scalability and robustness. Furthermore, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsElectric Vehicles and Infrastructure · Elevator Systems and Control
MethodsSoftmax · Attention Is All You Need · Q-Learning · Gradient Clipping
