Improving the generalizability and robustness of large-scale traffic signal control
Tianyu Shi, Francois-Xavier Devailly, Denis Larocque, Laurent, Charlin

TL;DR
This paper enhances traffic signal control by combining distributional and standard reinforcement learning, improving robustness to missing data and generalization across diverse road networks and traffic conditions.
Contribution
It introduces a novel ensemble of distributional and vanilla RL methods using IQN for large-scale traffic signal control, improving robustness and transferability.
Findings
Ensemble approach outperforms existing methods in robustness.
Improved zero-shot transfer to new networks.
Enhanced performance under missing sensor data.
Abstract
A number of deep reinforcement-learning (RL) approaches propose to control traffic signals. In this work, we study the robustness of such methods along two axes. First, sensor failures and GPS occlusions create missing-data challenges and we show that recent methods remain brittle in the face of these missing data. Second, we provide a more systematic study of the generalization ability of RL methods to new networks with different traffic regimes. Again, we identify the limitations of recent approaches. We then propose using a combination of distributional and vanilla reinforcement learning through a policy ensemble. Building upon the state-of-the-art previous model which uses a decentralized approach for large-scale traffic signal control with graph convolutional networks (GCNs), we first learn models using a distributional reinforcement learning (DisRL) approach. In particular, we use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Traffic Prediction and Management Techniques · Transportation Planning and Optimization
MethodsGreedy Policy Search
