Intelligent Traffic Light via Policy-based Deep Reinforcement Learning
Yue Zhu, Mingyu Cai, Chris Schwarz, Junchao Li, and Shaoping Xiao

TL;DR
This paper presents a policy-based deep reinforcement learning approach using PPO to optimize traffic light control in smart cities, outperforming value-based methods and adapting to variable traffic conditions.
Contribution
It introduces the use of PPO for traffic light control, compares it with other RL methods, and demonstrates robustness and adaptability to unbalanced traffic flows.
Findings
PPO outperforms DQN and DDQN in traffic light control.
Variable time interval phases improve traffic flow.
The controller is robust to environment and action disturbances.
Abstract
Intelligent traffic lights in smart cities can optimally reduce traffic congestion. In this study, we employ reinforcement learning to train the control agent of a traffic light on a simulator of urban mobility. As a difference from existing works, a policy-based deep reinforcement learning method, Proximal Policy Optimization (PPO), is utilized other than value-based methods such as Deep Q Network (DQN) and Double DQN (DDQN). At first, the obtained optimal policy from PPO is compared to those from DQN and DDQN. It is found that the policy from PPO performs better than the others. Next, instead of the fixed-interval traffic light phases, we adopt the light phases with variable time intervals, which result in a better policy to pass the traffic flow. Then, the effects of environment and action disturbances are studied to demonstrate the learning-based controller is robust. At last, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Transportation Planning and Optimization · Traffic Prediction and Management Techniques
MethodsDouble Q-learning · Entropy Regularization · Convolution · Experience Replay · Double DQN · Proximal Policy Optimization · Dense Connections · Q-Learning · Deep Q-Network
