Large-Scale Traffic Signal Control by a Nash Deep Q-network Approach
Yuli.Zhang, Shangbo.Wang, Ruiyuan.Jiang

TL;DR
This paper introduces an off-policy Nash Deep Q-Network (OPNDQN) for large-scale traffic signal control, effectively addressing the challenges of high-dimensional state-action spaces and non-stationarity in multi-agent reinforcement learning.
Contribution
The paper proposes a novel OPNDQN algorithm that finds Nash equilibria among neighboring intersections, improving convergence and scalability in large traffic networks.
Findings
OPNDQN outperforms existing MARL methods in simulations.
It achieves lower average queue lengths and waiting times.
The convergence rate of OPNDQN is higher than traditional approaches.
Abstract
Reinforcement Learning (RL) is currently one of the most commonly used techniques for traffic signal control (TSC), which can adaptively adjusted traffic signal phase and duration according to real-time traffic data. However, a fully centralized RL approach is beset with difficulties in a multi-network scenario because of exponential growth in state-action space with increasing intersections. Multi-agent reinforcement learning (MARL) can overcome the high-dimension problem by employing the global control of each local RL agent, but it also brings new challenges, such as the failure of convergence caused by the non-stationary Markov Decision Process (MDP). In this paper, we introduce an off-policy nash deep Q-Network (OPNDQN) algorithm, which mitigates the weakness of both fully centralized and MARL approaches. The OPNDQN algorithm solves the problem that traditional algorithms cannot be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Traffic Prediction and Management Techniques · Transportation Planning and Optimization
