Decentralized Deep Reinforcement Learning for Network Level Traffic Signal Control
Jin Guo

TL;DR
This paper introduces decentralized deep multi-agent reinforcement learning algorithms for traffic signal control, demonstrating superior performance over traditional methods in simulation by reducing delays and queue lengths.
Contribution
It proposes three novel decentralized MARL schemes with communication strategies, validated through SUMO simulations, outperforming the Max Pressure algorithm in traffic management.
Findings
S2R2L converges faster and performs better than IDQL and S2RL.
All MARL schemes outperform the Max Pressure algorithm in key metrics.
S2R2L reduces traffic delay by 34.55% and queue length by 10.91%.
Abstract
In this thesis, I propose a family of fully decentralized deep multi-agent reinforcement learning (MARL) algorithms to achieve high, real-time performance in network-level traffic signal control. In this approach, each intersection is modeled as an agent that plays a Markovian Game against the other intersection nodes in a traffic signal network modeled as an undirected graph, to approach the optimal reduction in delay. Following Partially Observable Markov Decision Processes (POMDPs), there are 3 levels of communication schemes between adjacent learning agents: independent deep Q-leaning (IDQL), shared states reinforcement learning (S2RL) and a shared states & rewards version of S2RL--S2R2L. In these 3 variants of decentralized MARL schemes, individual agent trains its local deep Q network (DQN) separately, enhanced by convergence-guaranteed techniques like double DQN, prioritized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Traffic Prediction and Management Techniques · Autonomous Vehicle Technology and Safety
MethodsQ-Learning · Dense Connections · Convolution · Deep Q-Network
