Scalable spectral representations for multi-agent reinforcement learning   in network MDPs

Zhaolin Ren; Runyu Zhang; Bo Dai; Na Li

arXiv:2410.17221·cs.MA·November 19, 2024

Scalable spectral representations for multi-agent reinforcement learning in network MDPs

Zhaolin Ren, Runyu Zhang, Bo Dai, Na Li

PDF

Open Access

TL;DR

This paper introduces scalable spectral local representations for network MDPs in multi-agent reinforcement learning, enabling efficient learning by exploiting network dynamics and providing convergence guarantees.

Contribution

It derives scalable spectral local representations for network MDPs and develops a convergent algorithmic framework for continuous state-action spaces.

Findings

01

Effective in two benchmark problems

02

Outperforms generic function approximation methods

03

Provides theoretical convergence guarantees

Abstract

Network Markov Decision Processes (MDPs), a popular model for multi-agent control, pose a significant challenge to efficient learning due to the exponential growth of the global state-action space with the number of agents. In this work, utilizing the exponential decay property of network dynamics, we first derive scalable spectral local representations for network MDPs, which induces a network linear subspace for the local $Q$ -function of each agent. Building on these local spectral representations, we design a scalable algorithmic framework for continuous state-action network MDPs, and provide end-to-end guarantees for the convergence of our algorithm. Empirically, we validate the effectiveness of our scalable representation-based approach on two benchmark problems, and demonstrate the advantages of our approach over generic function approximation approaches to representing the local…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Distributed Control Multi-Agent Systems · Mobile Agent-Based Network Management

MethodsExponential Decay