OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control
Rohit Bokade, Xiaoning Jin

TL;DR
OffLight is an offline multi-agent reinforcement learning framework designed for traffic signal control, effectively handling heterogeneous data to improve traffic flow and reduce congestion in urban environments.
Contribution
It introduces a novel offline MARL framework with importance sampling, prioritized experience replay, and a GMM-VGAE to address heterogeneous behavior policies in traffic datasets.
Findings
Achieves up to 7.8% reduction in average travel time.
Reduces queue length by 11.2%.
Outperforms existing offline RL methods in real-world scenarios.
Abstract
Efficient traffic control (TSC) is essential for urban mobility, but traditional systems struggle to handle the complexity of real-world traffic. Multi-agent Reinforcement Learning (MARL) offers adaptive solutions, but online MARL requires extensive interactions with the environment, making it costly and impractical. Offline MARL mitigates these challenges by using historical traffic data for training but faces significant difficulties with heterogeneous behavior policies in real-world datasets, where mixed-quality data complicates learning. We introduce OffLight, a novel offline MARL framework designed to handle heterogeneous behavior policies in TSC datasets. To improve learning efficiency, OffLight incorporates Importance Sampling (IS) to correct for distributional shifts and Return-Based Prioritized Sampling (RBPS) to focus on high-quality experiences. OffLight utilizes a Gaussian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Traffic Prediction and Management Techniques
MethodsEmirates Airlines Office in Dubai · Focus
