Network-Constrained Policy Optimization for Adaptive Multi-agent Vehicle Routing
Fazel Arasteh, Arian Haghparast, and Manos Papagelis

TL;DR
This paper introduces a multi-agent reinforcement learning framework for adaptive vehicle routing in urban networks, improving traffic flow and reducing congestion through decentralized and hierarchical models.
Contribution
It proposes a novel decentralized MARL model (AN) and a scalable hierarchical extension (HHAN) that coordinate vehicle routing using graph attention and centralized training.
Findings
AN reduces travel time compared to SPF and baseline methods.
HHAN scales to large networks with hundreds of intersections.
HHAN achieves up to 15.9% improvement in heavy traffic conditions.
Abstract
Traffic congestion in urban road networks leads to longer trip times and higher emissions, especially during peak periods. While the Shortest Path First (SPF) algorithm is optimal for a single vehicle in a static network, it performs poorly in dynamic, multi-vehicle settings, often worsening congestion by routing all vehicles along identical paths. We address dynamic vehicle routing through a multi-agent reinforcement learning (MARL) framework for coordinated, network-aware fleet navigation. We first propose Adaptive Navigation (AN), a decentralized MARL model where each intersection agent provides routing guidance based on (i) local traffic and (ii) neighborhood state modeled using Graph Attention Networks (GAT). To improve scalability in large networks, we further propose Hierarchical Hub-based Adaptive Navigation (HHAN), an extension of AN that assigns agents only to key…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
