A Robust and Efficient Multi-Agent Reinforcement Learning Framework for Traffic Signal Control
Sheng-You Huang, Hsiao-Chuan Chang, Yen-Chi Chen, Ting-Han Wei, I-Hau Yeh, Sheng-Yao Kuan, Chien-Yao Wang, Hsuan-Han Lee, I-Chen Wu

TL;DR
This paper introduces a robust multi-agent reinforcement learning framework for traffic signal control that improves generalization, stability, and responsiveness in dynamic traffic environments, validated through extensive simulations.
Contribution
It presents a novel MARL framework with turning ratio randomization, exponential phase adjustment, and neighbor-based observations, enhancing robustness and scalability over existing methods.
Findings
Reduces average waiting time by over 10%.
Outperforms standard RL baselines in unseen scenarios.
Maintains high control stability and adaptability.
Abstract
Reinforcement Learning (RL) in Traffic Signal Control (TSC) faces significant hurdles in real-world deployment due to limited generalization to dynamic traffic flow variations. Existing approaches often overfit static patterns and use action spaces incompatible with driver expectations. This paper proposes a robust Multi-Agent Reinforcement Learning (MARL) framework validated in the Vissim traffic simulator. The framework integrates three mechanisms: (1) Turning Ratio Randomization, a training strategy that exposes agents to dynamic turning probabilities to enhance robustness against unseen scenarios; (2) a stability-oriented Exponential Phase Duration Adjustment action space, which balances responsiveness and precision through cyclical, exponential phase adjustments; and (3) a Neighbor-Based Observation scheme utilizing the MAPPO algorithm with Centralized Training with Decentralized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Traffic Prediction and Management Techniques · Vehicular Ad Hoc Networks (VANETs)
