A Communication-Efficient Decentralized Actor-Critic Algorithm

Xiaoxing Ren; Nicola Bastianello; Thomas Parisini; Andreas A. Malikopoulos

arXiv:2510.19199·cs.LG·October 23, 2025

A Communication-Efficient Decentralized Actor-Critic Algorithm

Xiaoxing Ren, Nicola Bastianello, Thomas Parisini, Andreas A. Malikopoulos

PDF

Open Access

TL;DR

This paper introduces a decentralized actor-critic reinforcement learning algorithm that reduces communication among agents through local updates, with proven convergence and practical validation in cooperative control tasks.

Contribution

The paper proposes a novel communication-efficient decentralized actor-critic algorithm with finite-time convergence analysis and neural network approximation considerations.

Findings

01

Achieves $ ilde{O}(rac{1}{ au \, ext{epsilon}^3})$ sample complexity.

02

Communication complexity is reduced to $ ilde{O}(rac{1}{ au \, ext{epsilon}})$.

03

Numerical experiments validate theoretical results in cooperative control.

Abstract

In this paper, we study the problem of reinforcement learning in multi-agent systems where communication among agents is limited. We develop a decentralized actor-critic learning framework in which each agent performs several local updates of its policy and value function, where the latter is approximated by a multi-layer neural network, before exchanging information with its neighbors. This local training strategy substantially reduces the communication burden while maintaining coordination across the network. We establish finite-time convergence analysis for the algorithm under Markov-sampling. Specifically, to attain the $ε$ -accurate stationary point, the sample complexity is of order $O (ε^{- 3})$ and the communication complexity is of order $O (ε^{- 1} τ^{- 1})$ , where tau denotes the number of local training steps. We also show how…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Neural Networks and Reservoir Computing