Toward Dependency Dynamics in Multi-Agent Reinforcement Learning for   Traffic Signal Control

Yuli Zhang; Shangbo Wang; Dongyao Jia; Pengfei Fan; Ruiyuan Jiang,; Hankang Gu; and Andy H.F. Chow

arXiv:2502.16608·cs.AI·February 25, 2025

Toward Dependency Dynamics in Multi-Agent Reinforcement Learning for Traffic Signal Control

Yuli Zhang, Shangbo Wang, Dongyao Jia, Pengfei Fan, Ruiyuan Jiang,, Hankang Gu, and Andy H.F. Chow

PDF

Open Access

TL;DR

This paper explores how multi-agent reinforcement learning can adaptively optimize traffic signals by considering dependency dynamics among intersections, proposing a novel update strategy that improves convergence without losing optimality.

Contribution

It introduces a dependency-aware dynamic parameter update strategy for deep Q-networks in multi-agent traffic control, bridging centralized and decentralized RL approaches based on congestion conditions.

Findings

01

DQN-DPUS accelerates convergence in traffic signal control tasks.

02

The strategy maintains optimal exploration while improving learning speed.

03

Theoretical analysis confirms the effectiveness of dependency-based updates.

Abstract

Reinforcement learning (RL) emerges as a promising data-driven approach for adaptive traffic signal control (ATSC) in complex urban traffic networks, with deep neural networks substantially augmenting its learning capabilities. However, centralized RL becomes impractical for ATSC involving multiple agents due to the exceedingly high dimensionality of the joint action space. Multi-agent RL (MARL) mitigates this scalability issue by decentralizing control to local RL agents. Nevertheless, this decentralized method introduces new challenges: the environment becomes partially observable from the perspective of each local agent due to constrained inter-agent communication. Both centralized RL and MARL exhibit distinct strengths and weaknesses, particularly under heavy intersectional traffic conditions. In this paper, we justify that MARL can achieve the optimal global Q-value by separating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings