Multi-hop Upstream Anticipatory Traffic Signal Control with Deep   Reinforcement Learning

Xiaocan Li; Xiaoyu Wang; Ilia Smirnov; Scott Sanner; Baher Abdulhai

arXiv:2411.07271·cs.LG·January 20, 2025

Multi-hop Upstream Anticipatory Traffic Signal Control with Deep Reinforcement Learning

Xiaocan Li, Xiaoyu Wang, Ilia Smirnov, Scott Sanner, Baher Abdulhai

PDF

Open Access

TL;DR

This paper introduces a multi-hop upstream pressure concept for deep reinforcement learning-based traffic signal control, enabling broader spatial awareness and significantly reducing network delays in urban traffic management.

Contribution

It proposes a novel multi-hop upstream pressure metric based on Markov chain theory, enhancing the spatial awareness of traffic signal control agents in deep reinforcement learning.

Findings

01

Significantly reduces overall network delay in simulations.

02

Outperforms pressure-based methods focusing only on immediate upstream links.

03

Demonstrates effectiveness in both synthetic and real-world Toronto scenarios.

Abstract

Coordination in traffic signal control is crucial for managing congestion in urban networks. Existing pressure-based control methods focus only on immediate upstream links, leading to suboptimal green time allocation and increased network delays. However, effective signal control inherently requires coordination across a broader spatial scope, as the effect of upstream traffic should influence signal control decisions at downstream intersections, impacting a large area in the traffic network. Although agent communication using neural network-based feature extraction can implicitly enhance spatial awareness, it significantly increases the learning complexity, adding an additional layer of difficulty to the challenging task of control in deep reinforcement learning. To address the issue of learning complexity and myopic traffic pressure definition, our work introduces a novel concept…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management

MethodsFocus