Independent Reinforcement Learning for Weakly Cooperative Multiagent   Traffic Control Problem

Chengwei Zhang; Shan Jin; Wanli Xue; Xiaofei Xie and; Shengyong Chen; Rong Chen

arXiv:2104.10917·cs.LG·April 23, 2021·1 cites

Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem

Chengwei Zhang, Shan Jin, Wanli Xue, Xiaofei Xie and, Shengyong Chen, Rong Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel independent reinforcement learning approach, CIL-DDQN, for weakly cooperative multiagent traffic control, effectively handling partial observability and improving traffic flow optimization.

Contribution

It proposes a new IRL algorithm with experience and leniency mechanisms tailored for partially observable cooperative traffic control environments.

Findings

01

CIL-DDQN outperforms existing methods in traffic management metrics.

02

The experience and lenient mechanisms enhance cooperative strategy learning.

03

The approach effectively manages partial observability in multiagent traffic control.

Abstract

The adaptive traffic signal control (ATSC) problem can be modeled as a multiagent cooperative game among urban intersections, where intersections cooperate to optimize their common goal. Recently, reinforcement learning (RL) has achieved marked successes in managing sequential decision making problems, which motivates us to apply RL in the ASTC problem. Here we use independent reinforcement learning (IRL) to solve a complex traffic cooperative control problem in this study. One of the largest challenges of this problem is that the observation information of intersection is typically partially observable, which limits the learning performance of IRL algorithms. To this, we model the traffic control problem as a partially observable weak cooperative traffic model (PO-WCTM) to optimize the overall traffic situation of a group of intersections. Different from a traditional IRL task that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zcchenvy/CIL-DDQN
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Autonomous Vehicle Technology and Safety · Traffic Prediction and Management Techniques

MethodsConvolution · Experience Replay · Dense Connections · Q-Learning · Double Q-learning · Deep Q-Network · Double DQN