Diagnosing Reinforcement Learning for Traffic Signal Control
Guanjie Zheng, Xinshi Zang, Nan Xu, Hua Wei, Zhengyao Yu, Vikash, Gayah, Kai Xu, Zhenhui Li

TL;DR
This paper re-examines reinforcement learning for traffic signal control, proposing a simple, theoretically grounded method that guarantees travel time minimization and outperforms existing complex approaches.
Contribution
It introduces LIT, a simple and theoretically supported RL method with concise state and reward design for traffic signal control.
Findings
LIT significantly outperforms state-of-the-art methods.
Simple state and reward design is effective for traffic signal control.
Theoretically guarantees travel time minimization.
Abstract
With the increasing availability of traffic data and advance of deep reinforcement learning techniques, there is an emerging trend of employing reinforcement learning (RL) for traffic signal control. A key question for applying RL to traffic signal control is how to define the reward and state. The ultimate objective in traffic signal control is to minimize the travel time, which is difficult to reach directly. Hence, existing studies often define reward as an ad-hoc weighted linear combination of several traffic measures. However, there is no guarantee that the travel time will be optimized with the reward. In addition, recent RL approaches use more complicated state (e.g., image) in order to describe the full traffic situation. However, none of the existing studies has discussed whether such a complex state representation is necessary. This extra complexity may lead to significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Traffic Prediction and Management Techniques · Traffic and Road Safety
