Characterizing Speed Performance of Multi-Agent Reinforcement Learning

Samuel Wiggins; Yuan Meng; Rajgopal Kannan; Viktor Prasanna

arXiv:2309.07108·cs.LG·September 14, 2023

Characterizing Speed Performance of Multi-Agent Reinforcement Learning

Samuel Wiggins, Yuan Meng, Rajgopal Kannan, Viktor Prasanna

PDF

TL;DR

This paper analyzes the speed performance of multi-agent reinforcement learning algorithms, emphasizing latency and throughput, and provides a systematic analysis of bottlenecks on CPU platforms to guide future acceleration efforts.

Contribution

It introduces a taxonomy for MARL algorithms from an acceleration perspective and systematically analyzes three state-of-the-art algorithms' performance bottlenecks.

Findings

01

Identified key bottlenecks in MARL algorithms on CPU platforms.

02

Highlighted the importance of latency-bounded throughput as a performance metric.

03

Suggested opportunities for parallelization and acceleration in MARL implementations.

Abstract

Multi-Agent Reinforcement Learning (MARL) has achieved significant success in large-scale AI systems and big-data applications such as smart grids, surveillance, etc. Existing advancements in MARL algorithms focus on improving the rewards obtained by introducing various mechanisms for inter-agent cooperation. However, these optimizations are usually compute- and memory-intensive, thus leading to suboptimal speed performance in end-to-end training time. In this work, we analyze the speed performance (i.e., latency-bounded throughput) as the key metric in MARL implementations. Specifically, we first introduce a taxonomy of MARL algorithms from an acceleration perspective categorized by (1) training scheme and (2) communication method. Using our taxonomy, we identify three state-of-the-art MARL algorithms - Multi-Agent Deep Deterministic Policy Gradient (MADDPG), Target-oriented…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings