Communication-Efficient Cooperative Multi-Agent PPO via Regulated Segment Mixture in Internet of Vehicles
Xiaoxue Yu, Rongpeng Li, Fei Wang, Chenghui Peng, Chengchao Liang,, Zhifeng Zhao, and Honggang Zhang

TL;DR
This paper introduces RSM-MAPPO, a communication-efficient multi-agent reinforcement learning algorithm for Internet of Vehicles that reduces communication overhead while maintaining effective cooperative control.
Contribution
It proposes a novel distributed MARL method using segment mixture and theory-guided replica selection to improve communication efficiency in IoV scenarios.
Findings
RSM-MAPPO reduces communication overhead significantly.
The algorithm maintains effective policy improvement.
Simulation results confirm its effectiveness in traffic control.
Abstract
Multi-Agent Reinforcement Learning (MARL) has become a classic paradigm to solve diverse, intelligent control tasks like autonomous driving in Internet of Vehicles (IoV). However, the widely assumed existence of a central node to implement centralized federated learning-assisted MARL might be impractical in highly dynamic scenarios, and the excessive communication overheads possibly overwhelm the IoV system. Therefore, in this paper, we design a communication efficient cooperative MARL algorithm, named RSM-MAPPO, to reduce the communication overheads in a fully distributed architecture. In particular, RSM-MAPPO enhances the multi-agent Proximal Policy Optimization (PPO) by incorporating the idea of segment mixture and augmenting multiple model replicas from received neighboring policy segments. Afterwards, RSM-MAPPO adopts a theory-guided metric to regulate the selection of contributive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicular Ad Hoc Networks (VANETs) · Traffic control and management · Privacy-Preserving Technologies in Data
