Mitigating Partial Observability in Adaptive Traffic Signal Control with Transformers
Xiaoyu Wang, Ayal Taitler, Scott Sanner, Baher Abdulhai

TL;DR
This paper introduces Transformer-based controllers for adaptive traffic signal control to effectively handle partial observability, resulting in improved traffic flow and coordination in urban transportation systems.
Contribution
It presents a novel integration of Transformer architectures into ATSC systems to address partial observability challenges, enhancing policy learning and coordination.
Findings
Transformer models capture significant historical information.
Improved traffic flow and coordination in real-world scenarios.
Enhanced training efficiency and effectiveness.
Abstract
Efficient traffic signal control is essential for managing urban transportation, minimizing congestion, and improving safety and sustainability. Reinforcement Learning (RL) has emerged as a promising approach to enhancing adaptive traffic signal control (ATSC) systems, allowing controllers to learn optimal policies through interaction with the environment. However, challenges arise due to partial observability (PO) in traffic networks, where agents have limited visibility, hindering effectiveness. This paper presents the integration of Transformer-based controllers into ATSC systems to address PO effectively. We propose strategies to enhance training efficiency and effectiveness, demonstrating improved coordination capabilities in real-world scenarios. The results showcase the Transformer-based model's ability to capture significant information from historical observations, leading to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Traffic Prediction and Management Techniques
MethodsLinear Layer · Multi-Head Attention · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Softmax · Layer Normalization · Attention Is All You Need · Position-Wise Feed-Forward Layer · Dropout
