A Semi Centralized Training Decentralized Execution Architecture for Multi Agent Deep Reinforcement Learning in Traffic Signal Control

Arash Rezaali; Pouria Yazdani; Monireh Abdoos

arXiv:2512.04653·cs.MA·March 31, 2026

A Semi Centralized Training Decentralized Execution Architecture for Multi Agent Deep Reinforcement Learning in Traffic Signal Control

Arash Rezaali, Pouria Yazdani, Monireh Abdoos

PDF

TL;DR

This paper proposes a semi-centralized training architecture for multi-agent deep reinforcement learning in traffic signal control, balancing coordination and scalability.

Contribution

It introduces SEMI-CTDE, a region-based training approach that improves coordination and transferability in multi-intersection traffic management.

Findings

01

SEMI-CTDE outperforms fully decentralized baselines.

02

The architecture is highly transferable across different models.

03

Models achieve superior performance across various traffic conditions.

Abstract

Multi-agent reinforcement learning (MARL) has emerged as a promising paradigm for adaptive traffic signal control (ATSC) of multiple intersections. Existing approaches typically follow either a fully centralized or a fully decentralized design. Fully centralized approaches suffer from the curse of dimensionality, and reliance on a single learning server, whereas purely decentralized approaches operate under severe partial observability and lack explicit coordination resulting in suboptimal performance. These limitations motivate region-based MARL, where the network is partitioned into smaller, tightly coupled intersections that form regions, and training is organized around these regions. This paper introduces a Semi-Centralized Training, Decentralized Execution (SEMI-CTDE) architecture for multi intersection ATSC. Within each region, SEMI-CTDE performs centralized training with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.