Distributed Value Decomposition Networks with Networked Agents

Guilherme S. Varela; Alberto Sardinha; Francisco S. Melo

arXiv:2502.07635·cs.LG·February 12, 2025

Distributed Value Decomposition Networks with Networked Agents

Guilherme S. Varela, Alberto Sardinha, Francisco S. Melo

PDF

Open Access

TL;DR

This paper introduces distributed value decomposition networks (DVDN) for multi-agent reinforcement learning, enabling decentralized training with communication among agents, and demonstrates their effectiveness across multiple tasks and environments.

Contribution

The paper proposes DVDN and DVDN (GT), novel algorithms for decentralized multi-agent RL that do not require centralized training, addressing communication constraints.

Findings

01

DVDN approximates centralized value decomposition performance.

02

Both algorithms perform well despite communication-induced information loss.

03

Validated across ten tasks in three environments.

Abstract

We investigate the problem of distributed training under partial observability, whereby cooperative multi-agent reinforcement learning agents (MARL) maximize the expected cumulative joint reward. We propose distributed value decomposition networks (DVDN) that generate a joint Q-function that factorizes into agent-wise Q-functions. Whereas the original value decomposition networks rely on centralized training, our approach is suitable for domains where centralized training is not possible and agents must learn by interacting with the physical environment in a decentralized manner while communicating with their peers. DVDN overcomes the need for centralized training by locally estimating the shared objective. We contribute with two innovative algorithms, DVDN and DVDN (GT), for the heterogeneous and homogeneous agents settings respectively. Empirically, both algorithms approximate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation