Multi-Agent Congestion Cost Minimization With Linear Function   Approximations

Prashant Trivedi; Nandyala Hemachandra

arXiv:2301.10993·cs.LG·February 24, 2023

Multi-Agent Congestion Cost Minimization With Linear Function Approximations

Prashant Trivedi, Nandyala Hemachandra

PDF

Open Access 1 Repo

TL;DR

This paper introduces a decentralized multi-agent reinforcement learning algorithm, MACCM, that minimizes congestion costs in network traversal using linear function approximations, achieving sub-linear regret and preserving agent privacy.

Contribution

The paper proposes the MACCM algorithm with linear function approximations for decentralized congestion cost minimization, including convergence and regret analysis, and validates it on a network example.

Findings

01

MACCM achieves sub-linear regret in multi-agent congestion minimization.

02

The optimal policy balances node staying costs and congestion costs.

03

Average regret is near zero for 2 and 3 agents.

Abstract

This work considers multiple agents traversing a network from a source node to the goal node. The cost to an agent for traveling a link has a private as well as a congestion component. The agent's objective is to find a path to the goal node with minimum overall cost in a decentralized way. We model this as a fully decentralized multi-agent reinforcement learning problem and propose a novel multi-agent congestion cost minimization (MACCM) algorithm. Our MACCM algorithm uses linear function approximations of transition probabilities and the global cost function. In the absence of a central controller and to preserve privacy, agents communicate the cost function parameters to their neighbors via a time-varying communication network. Moreover, each agent maintains its estimate of the global state-action value, which is updated via a multi-agent extended value iteration (MAEVI) sub-routine.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pra1trivedi/maccm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · Privacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing