Toward Dependency Dynamics in Multi-Agent Reinforcement Learning for Traffic Signal Control
Yuli Zhang, Shangbo Wang, Dongyao Jia, Pengfei Fan, Ruiyuan Jiang,, Hankang Gu, and Andy H.F. Chow

TL;DR
This paper explores how multi-agent reinforcement learning can adaptively optimize traffic signals by considering dependency dynamics among intersections, proposing a novel update strategy that improves convergence without losing optimality.
Contribution
It introduces a dependency-aware dynamic parameter update strategy for deep Q-networks in multi-agent traffic control, bridging centralized and decentralized RL approaches based on congestion conditions.
Findings
DQN-DPUS accelerates convergence in traffic signal control tasks.
The strategy maintains optimal exploration while improving learning speed.
Theoretical analysis confirms the effectiveness of dependency-based updates.
Abstract
Reinforcement learning (RL) emerges as a promising data-driven approach for adaptive traffic signal control (ATSC) in complex urban traffic networks, with deep neural networks substantially augmenting its learning capabilities. However, centralized RL becomes impractical for ATSC involving multiple agents due to the exceedingly high dimensionality of the joint action space. Multi-agent RL (MARL) mitigates this scalability issue by decentralizing control to local RL agents. Nevertheless, this decentralized method introduces new challenges: the environment becomes partially observable from the perspective of each local agent due to constrained inter-agent communication. Both centralized RL and MARL exhibit distinct strengths and weaknesses, particularly under heavy intersectional traffic conditions. In this paper, we justify that MARL can achieve the optimal global Q-value by separating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
