Multi-agent Reinforcement Learning for Networked System Control

Tianshu Chu; Sandeep Chinchali; Sachin Katti

arXiv:2004.01339·cs.LG·April 27, 2020·62 cites

Multi-agent Reinforcement Learning for Networked System Control

Tianshu Chu, Sandeep Chinchali, Sachin Katti

PDF

Open Access 1 Repo

TL;DR

This paper advances multi-agent reinforcement learning for networked control systems by introducing a spatial discount factor and a novel communication protocol, NeurComm, improving training stability and performance in traffic and cruise control scenarios.

Contribution

It proposes a new NMARL framework with a spatial discount factor and NeurComm protocol, enhancing learning stability and communication efficiency.

Findings

01

Spatial discount factor improves learning curves.

02

NeurComm outperforms existing protocols.

03

Enhanced control performance in traffic and cruise scenarios.

Abstract

This paper considers multi-agent reinforcement learning (MARL) in networked system control. Specifically, each agent learns a decentralized control policy based on local observations and messages from connected neighbors. We formulate such a networked MARL (NMARL) problem as a spatiotemporal Markov decision process and introduce a spatial discount factor to stabilize the training of each local agent. Further, we propose a new differentiable communication protocol, called NeurComm, to reduce information loss and non-stationarity in NMARL. Based on experiments in realistic NMARL scenarios of adaptive traffic signal control and cooperative adaptive cruise control, an appropriate spatial discount factor effectively enhances the learning curves of non-communicative MARL algorithms, while NeurComm outperforms existing communication protocols in both learning efficiency and control performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cts198859/deeprl_network
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Reinforcement Learning in Robotics · Smart Grid Security and Resilience