Multi-Agent Congestion Cost Minimization With Linear Function Approximations
Prashant Trivedi, Nandyala Hemachandra

TL;DR
This paper introduces a decentralized multi-agent reinforcement learning algorithm, MACCM, that minimizes congestion costs in network traversal using linear function approximations, achieving sub-linear regret and preserving agent privacy.
Contribution
The paper proposes the MACCM algorithm with linear function approximations for decentralized congestion cost minimization, including convergence and regret analysis, and validates it on a network example.
Findings
MACCM achieves sub-linear regret in multi-agent congestion minimization.
The optimal policy balances node staying costs and congestion costs.
Average regret is near zero for 2 and 3 agents.
Abstract
This work considers multiple agents traversing a network from a source node to the goal node. The cost to an agent for traveling a link has a private as well as a congestion component. The agent's objective is to find a path to the goal node with minimum overall cost in a decentralized way. We model this as a fully decentralized multi-agent reinforcement learning problem and propose a novel multi-agent congestion cost minimization (MACCM) algorithm. Our MACCM algorithm uses linear function approximations of transition probabilities and the global cost function. In the absence of a central controller and to preserve privacy, agents communicate the cost function parameters to their neighbors via a time-varying communication network. Moreover, each agent maintains its estimate of the global state-action value, which is updated via a multi-agent extended value iteration (MAEVI) sub-routine.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAge of Information Optimization · Privacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing
