Communication-Efficient Cooperative Multi-Agent PPO via Regulated   Segment Mixture in Internet of Vehicles

Xiaoxue Yu; Rongpeng Li; Fei Wang; Chenghui Peng; Chengchao Liang,; Zhifeng Zhao; and Honggang Zhang

arXiv:2308.04198·cs.MA·August 9, 2023

Communication-Efficient Cooperative Multi-Agent PPO via Regulated Segment Mixture in Internet of Vehicles

Xiaoxue Yu, Rongpeng Li, Fei Wang, Chenghui Peng, Chengchao Liang,, Zhifeng Zhao, and Honggang Zhang

PDF

Open Access

TL;DR

This paper introduces RSM-MAPPO, a communication-efficient multi-agent reinforcement learning algorithm for Internet of Vehicles that reduces communication overhead while maintaining effective cooperative control.

Contribution

It proposes a novel distributed MARL method using segment mixture and theory-guided replica selection to improve communication efficiency in IoV scenarios.

Findings

01

RSM-MAPPO reduces communication overhead significantly.

02

The algorithm maintains effective policy improvement.

03

Simulation results confirm its effectiveness in traffic control.

Abstract

Multi-Agent Reinforcement Learning (MARL) has become a classic paradigm to solve diverse, intelligent control tasks like autonomous driving in Internet of Vehicles (IoV). However, the widely assumed existence of a central node to implement centralized federated learning-assisted MARL might be impractical in highly dynamic scenarios, and the excessive communication overheads possibly overwhelm the IoV system. Therefore, in this paper, we design a communication efficient cooperative MARL algorithm, named RSM-MAPPO, to reduce the communication overheads in a fully distributed architecture. In particular, RSM-MAPPO enhances the multi-agent Proximal Policy Optimization (PPO) by incorporating the idea of segment mixture and augmenting multiple model replicas from received neighboring policy segments. Afterwards, RSM-MAPPO adopts a theory-guided metric to regulate the selection of contributive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVehicular Ad Hoc Networks (VANETs) · Traffic control and management · Privacy-Preserving Technologies in Data