HALO: Hierarchical Reinforcement Learning for Large-Scale Adaptive Traffic Signal Control
Yaqiao Zhu, Hongkai Wen, Geyong Min, Man Luo

TL;DR
HALO introduces a hierarchical reinforcement learning framework for large-scale adaptive traffic signal control, effectively balancing global coordination and local decision-making to improve traffic flow in complex urban networks.
Contribution
The paper presents a novel hierarchical RL approach with Transformer-LSTM encoders and an adversarial goal-setting mechanism for scalable, coordinated traffic signal control.
Findings
HALO outperforms state-of-the-art methods in large-scale traffic networks.
HALO reduces average travel time by up to 6.8%.
HALO maintains robust performance across diverse traffic scenarios.
Abstract
Adaptive traffic signal control (ATSC) is essential for mitigating urban congestion in modern smart cities, where traffic infrastructure is evolving into interconnected Web-of-Things (WoT) environments with thousands of sensing-and-control nodes. However, existing methods face a critical scalability-coordination tradeoff: centralized approaches optimize global objectives but become computationally intractable at city scale, while decentralized multi-agent methods scale efficiently yet lack network-level coherence, resulting in suboptimal performance. In this paper, we present HALO, a hierarchical reinforcement learning framework that addresses this tradeoff for large-scale ATSC. HALO decouples decision-making into two levels: a high-level global guidance policy employs Transformer-LSTM encoders to model spatio-temporal dependencies across the entire network and broadcast compact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Electrostatic Discharge in Electronics · Adversarial Robustness in Machine Learning
