A Semi Centralized Training Decentralized Execution Architecture for Multi Agent Deep Reinforcement Learning in Traffic Signal Control
Arash Rezaali, Pouria Yazdani, Monireh Abdoos

TL;DR
This paper proposes a semi-centralized training architecture for multi-agent deep reinforcement learning in traffic signal control, balancing coordination and scalability.
Contribution
It introduces SEMI-CTDE, a region-based training approach that improves coordination and transferability in multi-intersection traffic management.
Findings
SEMI-CTDE outperforms fully decentralized baselines.
The architecture is highly transferable across different models.
Models achieve superior performance across various traffic conditions.
Abstract
Multi-agent reinforcement learning (MARL) has emerged as a promising paradigm for adaptive traffic signal control (ATSC) of multiple intersections. Existing approaches typically follow either a fully centralized or a fully decentralized design. Fully centralized approaches suffer from the curse of dimensionality, and reliance on a single learning server, whereas purely decentralized approaches operate under severe partial observability and lack explicit coordination resulting in suboptimal performance. These limitations motivate region-based MARL, where the network is partitioned into smaller, tightly coupled intersections that form regions, and training is organized around these regions. This paper introduces a Semi-Centralized Training, Decentralized Execution (SEMI-CTDE) architecture for multi intersection ATSC. Within each region, SEMI-CTDE performs centralized training with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
