Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem
Chengwei Zhang, Shan Jin, Wanli Xue, Xiaofei Xie and, Shengyong Chen, Rong Chen

TL;DR
This paper introduces a novel independent reinforcement learning approach, CIL-DDQN, for weakly cooperative multiagent traffic control, effectively handling partial observability and improving traffic flow optimization.
Contribution
It proposes a new IRL algorithm with experience and leniency mechanisms tailored for partially observable cooperative traffic control environments.
Findings
CIL-DDQN outperforms existing methods in traffic management metrics.
The experience and lenient mechanisms enhance cooperative strategy learning.
The approach effectively manages partial observability in multiagent traffic control.
Abstract
The adaptive traffic signal control (ATSC) problem can be modeled as a multiagent cooperative game among urban intersections, where intersections cooperate to optimize their common goal. Recently, reinforcement learning (RL) has achieved marked successes in managing sequential decision making problems, which motivates us to apply RL in the ASTC problem. Here we use independent reinforcement learning (IRL) to solve a complex traffic cooperative control problem in this study. One of the largest challenges of this problem is that the observation information of intersection is typically partially observable, which limits the learning performance of IRL algorithms. To this, we model the traffic control problem as a partially observable weak cooperative traffic model (PO-WCTM) to optimize the overall traffic situation of a group of intersections. Different from a traditional IRL task that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management · Autonomous Vehicle Technology and Safety · Traffic Prediction and Management Techniques
MethodsConvolution · Experience Replay · Dense Connections · Q-Learning · Double Q-learning · Deep Q-Network · Double DQN
